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The National Study of CMO Effectiveness is a longitudinal research effort designed to 
measure how nonprofit charter school management organizations (CMOs) affect student 
achievement and to examine the internal stmctures, practices, and policy contexts that may 
influence these outcomes. The study began in May 2008 and wiU conclude in 2012. 

This report presents findings on CMO students, resources, and practices as well as CMO 
impacts on student achievement in middle school. It also examines the relationships between 
CMO practices and impacts. A subsequent version of this report will include findings on CMO 
impacts on high school graduation and postsecondary enrollment. 

The study is being conducted by Mathematica PoUcy Research and the University of 
Washington’s Center on Reinventing Public Education (CRPE). It was commissioned by 
NewSchools Venture Fund, with the generous support of the Bill & Melinda Gates Foundation 
and the Walton Family Foundation. 

Mathematica Policy Research ( www.mathematica-mpr.com) seeks to improve public well- 
being by conducting studies and assisting clients with program evaluation and policy research, 
survey design and data collection, research assessment and interpretation, and program 
performance/ data management. Its clients include foundations, federal and state governments, 
and private-sector and international organizations. The employee-owned company, with offices in 
Princeton, NJ, Ann Arbor, MI, Cambridge, MA, Chicago, IL, Oakland, CA, and Washington, DC, 
has conducted some of the most important studies of health care, international, disability, 
education, family support, employment, nutrition, and early childhood policies and programs. 

The Center on Reinventing Public Education at the University of Washington engages in 
independent research and policy analysis on a range of K-12 public education reform issues, 
including choice & charters, finance & productivity, teachers, urban district reform, leadership, 
and state & federal reform. CRPE’s work is based on two premises: that public schools should be 
measured against the goal of educating aU children well, and that current institutions too often fail 
to achieve this goal. Our research uses evidence from the field and lessons learned from other 
sectors to understand complicated problems and to design innovative and practical solutions for 
policymakers, elected officials, parents, educators, and community leaders. 
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EXECUTIVE SUMMARY 

Charter schools — pubUc schools of choice that are operated autonomously, outside the direct 
control of local school districts — have become more prevalent over the past two decades. There is 
no consensus about whether, on average, charter schools are doing better or worse than 
conventional public schools at promoting the achievement of their students. Nonetheless, one 
research finding is clear: Effects vary widely among different charter schools. Many educators, 
policymakers, and funders are interested in ways to identify and replicate successful charter schools 
and help other public schools adopt effective charter school practices. 

Charter-school management organizations (CMOs), which establish and operate multiple 
charter schools, represent one prominent attempt to bring high performance to scale. Many CMOs 
were created in order to replicate educational approaches that appeared to be effective, particularly 
among disadvantaged students. Attracting substantial philanthropic support, CMO schools have 
grown rapidly from encompassing about 6 percent of all charter schools in 2000 to about 17 percent 
of a much larger number of charter schools by 2009 (Miron 2010). Some of these organizations have 
received laudatory attention through anecdotal reports of dramatic achievement results. 

The National Study of CMO Effectiveness aims to fill the gap in systematic evidence about 
CMOs, providing the first rigorous nationwide examination of CMO achievement effects. The study 
includes an examination of the relationships between the practices of individual CMOs and their 
effects on student achievement, with the aim of providing useful guidance to the field. Mathematica 
Policy Research and the Center on Reinventing Public Education (CRPE) are conducting the study 
with funding from the Bill & Melinda Gates Foundation and the Walton Family Foundation, and 
project management assistance from the NewSchools Venture Fund. This report provides key 
findings from the study on CMO practices, impacts, and the relationships between them. Additional 
reports will explore promising practices in greater depth and examine longer-term impacts of CMOs 
on high school graduation and college entry. 

A. Research Questions, CMOs in Study, and Data Sources 

This study uses multiple data sources to describe CMOs, assess their impacts on students, and 
identify practices associated with positive impacts in order to address the following research 
questions: 

1. Characteristics and Context. Flow quickly are CMOs growing? Which students and 
areas do they serve and what resources do they use? What are the practices and 
structures of CMOs? What state policies and other factors appear to influence the 
location and growth of CMOs? 

2. Impacts. What are the impacts of CMOs on student outcomes and to what extent do 
these impacts vary across CMOs? 

3. Promising Practices. Which CMO practices and structures are positively associated 
with impacts? 

Previous studies have defined CMOs in various ways. Our study focuses primarily on nonprofit 
CMOs that directly control four or more schools. To be eligible for the study, an organization had 
to satisfy four criteria: it had to (1) have four schools open by fall 2007, (2) be nonprofit since 
inception, (3) not primarily serve dropouts or similar special populations, and (4) directly manage 


xxi 


Executive Summary 


schools (that is, be able to hire and fire school principals). Across the United States, we identified 40 
CMOs with 292 schools that satisfied the four criteria. These CMOs schools are located in 14 states, 
with the largest concentrations in Texas, California, Arizona, Ohio, Illinois, New York, and the 
District of Columbia. 

To examine eligible CMOs and address the research questions, we conducted a survey of CMO 
central office staff, surveys of CMO principals and principals in nearby conventional public schools, 
a survey of CMO teachers, and site visits to 10 CMOs and 20 schools. In addition, we collected and 
analyzed school records with data on student characteristics and outcomes (including test scores), 
and we examined CMO financial records and business plans. 

B. Areas and Students Served and Resources Used by CMOs 

1. CMO-mn schools now represent a substantial share of charter schools and are 

concentrated in certain states and urban areas 

Broadly defined, there are approximately 130 CMOs in the United States, serving nearly 
250,000 students.' CMOs represent approximately 20 percent of the approximately 5,000 charter 
schools operating nationally, up from 12 percent in 1999. Between 1999 and 2009, the number of 
CMO schools increased by approximately 20 percent per year, substantially more than the rate for 
independent charter schools (Figure 1). In 2009, growth in CMO schools appeared to slow to a rate 
comparable to that of independent charter schools. The growth trajectory of individual CMOs 
varies according to mission, capacity and constraints imposed by states and charter authorizers. On 
average, CMOs in our study had opened about one new school per year for the first six years. After 
seven years of operation, the average pace picks up to approximately two new schools per year. 

Nationally, the growth of CMO schools is concentrated in a handful of states. About 80 percent 
of aU CMO-mn schools operate in Texas, California, Arizona, and Ohio. Most CMOs have located 
in states where the charter law offers moderate to high levels of autonomy to charter operators. 

Figure 1. Growth of CMO Schools 


25% 


20% 

1 5 % Total CMO Charter 

Schools 

10% ^—Percent of All 

Charter Schools 
5% 


0 % 


Source: National Alliance for Public Charter Schools. 


' These counts employ a broad definition of CMO — namely any nonprofit organization managing two or more 
schools. By contrast, our study focuses on the 40 CMOs that were managing four or more schools In 2007. 
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CMO schools are also concentrated in urban areas. About 74 percent of all CMO schools 
eligible for our study are located in cities. In some cities, such as New Orleans, Newark, Los 
Angeles, and Chicago, CMOs now represent a substantial fraction of all charter schools. Some 
CMOs gravitate to urban centers because they are large enough to allow for the creation of a 
concentrated network of schools, making it easier and more economical for CMO staff to support 
them. And many CMOs have sought to focus on assisting low-income students and have opened 
schools in areas where they hope to attract these students. 

2. Relative to their host districts, CMOs serve a disproportionately large number of black, 

Hispanic, and low-income students but fewer special needs students 

Compared to their host districts, the middle school student population served by the average 
CMO in our study includes a greater percentage of minority and low-income students (Table 1). In 
the average CMO, approximately 91 percent of middle school students are black or Hispanic 
compared to 76 percent of their host districts’ middle school students. For the average CMO in our 
sample, approximately 7 1 percent of middle school students qualify for free or reduced-price lunch, 
compared to 64 percent of those served by their host districts. On the other hand, CMOs in this 
study serve a smaller share of students with disabilities and limited English proficiency. 

Most CMOs attract students whose previous average test scores are similar to local averages but 
higher than local averages for black and Hispanic. Most CMOs — 18 of 22 in our middle-school 
sample — enroll students whose prior achievement levels are within a quarter of a standard deviation 
of the overall average for their districts. But the black and Hispanic students entering most CMOs 
tend to have higher average test scores than their black and Hispanic peers in other public schools. 
In 13 of the 22 CMOs black students have significandy larger average math baseline test scores than 
the average for black students in the host district; only 1 of the 22 CMOs served black students with 
significandy lower average math test scores than their district peers. The patterns are similar for 
Hispanics and also similar for reading test scores. 


Table 1. CMOs serve largely low- income, minority students, but English-language learners and 
special education students are somewhat under- represented 



Percentage Black 
or Hispanic 

Percentage Free or 
Reduced-Price 
Lunch 

Percentage Limited 
English Proficient 

Percentage with 
lEP 

CMO Average 

91% 

71% 

14% 

9% 

Host District 

Average 

76% 

64% 

19% 

13% 


Source: State and district school records. 


lEP = individualized education plan. 

3. Pet-pupil expenditures in CMOs vary widely, along with public funding formulas 

The CMOs in our sample spent between $5,000 and $20,000 per pupil each year. This variation 
appeared to be partiy driven by the per-pupil funding amounts determined by state public funding 
formulas: the correlation between per-pupil expenditure and per-pupil state funding is 0.61. In 
addition, philanthropy probably contributed to the variation across CMOs in per-pupil expenditures. 

CMOs use their resources to support organizational stmctures and functions similar to those of 
school districts, but they vary widely in how they allocate their staff across various functions as well 
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as in the ratio of central office staff to number of students.. Some CMOs invest heavily in large 
central offices, while others maintain a minimal level of administrative staff. CMOs Many CMOs 
also allocate staff to support the creation of new schools. 

4. On average, CMOs schools tend to be much smaller than schools in their host districts, 
with marginally lower student-teacher ratios 

Because their organizational mission is often focused on personalized learning and a strong, 
intimate school culture, one might expect CMOs to have smaller school and class sizes. Indeed, 
CMO schools are much smaller than nearby schools in their districts. The CMOs in our study 
average 389 students per school compared to 982 students for nearby district schools. This 
difference may shrink somewhat as CMO schools grow to full capacity. Class sizes and pupil-to- 
instmctor ratios are also smaller in CMO schools than in their host districts. The average pupil-to- 
instmctor ratios in math and reading are about 20.9 students per instructor; by contrast, in 
comparison schools the ratios are 23.5 in math and 23.2 in reading. (The difference between these 
CMO and district ratios is statistically significant in math but not in reading). 

C. CMO Practices 

1. CMOs are less likely than districts to prescribe a particular curriculum or student 
behavior policy, but CMO principals report more often than district principals that they 
implement a school-wide behavior strategy 

In theory, CMOs could either provide principals with substantial discretion in selecting 
curricula and instmctional materials or require all schools to adopt the same curriculum. We found 
that, relative to districts, CMOs are less likely to mandate a specific curriculum. 

We also compared CMO schools and district schools on student behavior policy. CMO 
principals are significantly more likely than district principals to report that their schools have 
school-wide behavior standards (95 percent vs. 76 percent) and require students or parents to sign 
responsibility agreements (74 percent vs. 54 percent). But they also report more flexibility than 
district principals in defining the details of all behavior policies. 

2. Relative to districts, CMO principals report that their teachers receive more coaching 
and that their teachers’ pay is more likely to be based on performance 

Some CMOs appear to pay considerable attention to the support and evaluation of teachers. 
Principal surveys suggest that CMO schools engage in more intense monitoring and coaching of 
teachers relative to district schools. On average, CMO principals report more frequent observations 
of teachers by administrators see Figure 2), more frequent feedback to teachers from individuals 
observing them, and more frequent submission of lesson plans by teachers for review. 

CMO schools are also more likely than district schools to use performance-based 
compensation. On average, 69 percent of CMO principals report using student test scores to 
evaluate teachers, compared to 46 percent of principals in nearby district schools. In addition, CMO 
principals report placing a higher priority on student test scores and teacher observations than on 
tenure and education in determining teachers’ pay. 
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Figure 2. CMO Administrators Observe Teachers More Often 



times times 


■ CMO 

■ District 


Mean Annual Frequency of Observations of Teachers by Administrators 


Source: Principal survey. 

3. Relative to district principals, CMO principals report that their schools provide more 
instructional time 


Some CMOs view increased instructional time as a key strategy for promoting achievement. 
Relative to district schools, CMOs tend to require significandy more time in school. The principal 
survey indicates that the average CMO provides 1,373 hours of instruction time per year compared 
to 1,239 hours in the nearby district schools. About 40 percent of CMOs report more than 1,400 
instruction hours annually, and none of the district schools have this much (see Figure 3). The 
difference is driven largely by the length of the school day (rather than days in the school year), 
averaging 7.5 hours in the CMOs and 6.9 hours in district schools. 


Figure 3. CMOs Offer More Annual Instructional Time 
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4. CMOs can be categorized into four clusters that are differentiated largely by their 
emphasis on schoolwide behavior policies, teacher coaching, instructional time, 
formative assessment use, and performance based compensation 

We used cluster analysis to categorize CMOs into four groups defined by their practices. One 
group (“Data Driven”) is distinguished by its emphasis on performance-based compensation and 
use of formative assessment data. A second group (“Time on Task”) places greatest emphasis on 
school-wide behavior policies and requires the most instructional hours for its students. Both of the 
first two groups also make extensive use of teacher coaching. A third set of CMOs (“Incremental 
Innovation”) deviates the least from practices typical of conventional, district-operated schools. The 
fourth group (“Alternative Approach”) is made up of a pair of CMOs that place the least emphasis 
on aU of these practices. 

D. CMO Impacts on Student Achievement 

To estimate the effects of CMOs on student achievement, we examined the gains in test scores 
of individual students from before they entered CMO schools until up to three years after entry, as 
compared to gains of a matched comparison group of students who resembled the CMO students in 
terms of baseline test scores and other key characteristics. Students who transferred out of CMO 
schools after the first year enrolled were kept in the CMO “treatment” group for the analysis. This 
ensures that impact estimates are not artificially inflated by the departure of low-scoring students. It 
also means that our impact estimates are conservative, in the sense that students who remain 
enrolled in CMOs for more than a year are likely to experience larger impacts than what we report 
here. 

Pre-CMO test scores are a key element of the matched comparison design, which means that 
elementary schools cannot be included, because pre-kindergarten students do not typically have test 
scores. The next version of this report will examine longer-term effects of CMO high schools on 
graduation and college entry; the current impact analysis focuses on CMOs with middle school 
grades, where test coverage is more complete and consistent than at any other grade level. 
Specifically, we were able to estimate impacts for 22 of the 26 CMOs that were eligible for this study 
and operate middle schools. 

We tested the validity of our “propensity-score matching” method in a subset of CMO schools 
where it was possible to conduct a randomized experiment — the “gold standard” of evaluation 
methodology. The matching approach successfully replicated experimental results, thereby providing 
some confidence that it can produce valid impact estimates in the much larger number of CMO 
schools where it can be applied but where the experimental analysis is not possible. We report 
impacts in standard deviation units (also known as z-scores) to allow comparisons across grades and 
states that have different test scales. 

1. Test score impact estimates for the average CMO after two to three years in middle 
school are positive in all four subjects, but they are not statistically significant 

We estimated impacts of the average CMO in reading, math, science and social studies, one to 
three years after a student’s initial enrollment in the CMO school (though science and social studies 
test scores were available only for a subset of CMOs and years). Average CMO impacts are positive 
in all cases but one (one-year reading impact), but they are not statistically significant (at the .05 
level), despite reaching a non- trivial magnitude in math by the third year after enrollment. (Table 2). 
Our statistical power to detect an average impact across CMOs is limited by the fact that only 22 
CMOs are included in the analysis (fewer for science and social studies). 
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Table 2. Average CMO Test Score Impacts, by Year After CMO Enrollment 



1 -Year impact 

2-Year impact 

3-Year impact 

Math 

0.06 

0.1 r 

0.15 


(0.05) 

(0.06) 

(0.09) 

Number of CMOs 

22 

22 

14 

Reading 

-0.01 

0.03 

0.05 


(0.02) 

(0.03) 

(0.04) 

Number of CMOs 

22 

22 

20 

Science 

N.A. 

N.A. 

0.06 




(0.09) 

Number of CMOs 



1 1 

Social Studies 
Number of CMOs 

N.A. 

N.A. 

0.09 

(0.09) 

9 


Source: State, district, and CMO school records. 

"Significantly different from zero at the .1 0 level, two-tailed test. 

2. Achievement impacts for individual CMOs are more often positive than negative, but 
vary substantially in both directions 

The remainder of our analyses focus primarily on the cumulative impacts of individual CMOs 
on students two years after enrollment. We focus on two-year impact estimates because two years is 
the longest period we can examine all 22 CMOs for which impacts can be estimated. 

The overall average impacts mask a great deal of variation among CMOs. Two years after 
students enroll in the CMOs covered by the impact analysis, they experience significantly positive 
math impacts in half of these CMOs (11 of 22), while students in about one-third of the CMOs (7 of 
22) do significantly worse in math. Similarly, students in nearly half of the CMOs (10 of 22) 
experience significantly positive impacts in reading, while students in about a quarter of CMOs (6 of 
22) experience reading impacts that are significandy negative. Table 3 shows that half of the CMOs 
(1 1 of 22) have significandy positive impacts in math or reading and nine have significandy negative 
impacts in one or both subjects; 10 of the 22 CMOs have significantly positive impacts in both 
subjects while only four have significandy negative impacts in both subjects. 


Table 3. Number of CMOs With Positive and Negative Impacts in Math and Reading 




Math Impacts 


Reading Impacts 

Significant Positive 

insignificant 

Significant Negative 

Significant Positive 

10 

0 

0 

insignificant 

1 

2 

3 

Significant Negative 

0 

2 

4 


Source: State, district, and CMO school records. 
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There is also substantial variation in the magnitude of impacts. Figure 4 shows the distribution 
of estimated two-year math and reading impacts respectively on the x and y-axes. The size of the 
bubbles in Figure 4 represents the number of schools each CMO operated in fall 2009. Two years 
after CMO enrollment, math impacts range between -0.30 and 0.63 while reading impacts range 
between -0.22 and 0.24. The majority of two-year impacts are statistically significant (18 out of 22 
for math; and 16 out of 22 for reading). At the extremes, the impacts are substantial: A few CMOs 
produce impacts that might generate three years of learning gains within two years of enrollment 
(Bloom et al. 2008). These numbers suggest that the CMOs at the high end of the scale have the 
potential to measurably reduce achievement gaps, especially in math. Meanwhile, the lowest 
performing CMOs are producing negative achievement effects that are nearly as large as the effect 
of a year of schooling — that is, their students achieve not much more than one year of learning after 
two years in the classroom. 


Figure 4. Test- Score Impacts in Math and Reading Vary Considerably Across CMOs 
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Source: State, district, and CMO school records. 


As indicated in Figure 4, most of the larger CMOs in our sample have positive impacts, which 
might indicate that funders have had some success in supporting the expansion of CMOs that are 
more effective. Among the CMOs covered by the impact analysis, 8 of the 12 larger ones (those 
operating more than 8 schools in 2009-10) have significant positive impacts in at least one subject; 
by contrast, this is the case for only 3 of the 10 small CMOs (those operating 8 or fewer schools in 
2009-10). CMOs that have significant positive impacts in both reading and math operate an average 
of approximately 12 schools, while those with negative impacts in both subjects operate an average 
of approximately 6 schools. Despite this pattern, effectiveness is not related to size in a linear way: 
Correlations between size and impacts (and between growth rate and impacts) are not statistically 
significant. 
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Although the larger CMOs often have positive impacts, this does not mean that CMOs increase 
their performance as they grow. Within individual CMOs, some show declining impacts as they add 
schools, while others do not. In math, there is no clear pattern of changes in impacts as CMOs 
grow, but in reading, the impacts of most CMOs declined as they grew. 

3. CMOs that produce positive impacts in one subject tend to produce positive impacts in 
other subjects, including science, social studies, reading, and math 

The upward-sloping diagonal pattern of the results in Figure 4 shows that CMOs’ math and 
reading impacts are positively correlated. Similar patterns are evident for science and social studies. 
Within CMOs, impacts are highly correlated across the four subjects. Science and social studies 
impacts, like reading and math impacts, vary substantially across CMOs. But we found no evidence 
that CMOs are focusing on some subjects at the expense of others: 

4. In several CMOs, math and reading test score impacts are larger for Hispanic students 
than for other students 

Prompted by prior studies of charter schools (Angrist et al. 2011; Gleason et al. 2010) that have 
found suggestive evidence of greater benefits for low-income minority students in urban areas, we 
compared two-year math and reading impacts for particular subgroups of students. We found some 
evidence of larger two-year math and reading impacts for Hispanic students in the nine CMOs for 
which sufficient data was available. Other subgroups — including African Americans and groups 
defined by gender, income, and prior academic achievement — do not show clear patterns of 
differential positive or negative impacts. 

E. Practices Associated with Positive Impacts 

Understanding which CMO practices are associated with the largest impacts can help identify 
potentially promising educational strategies. To be sure, the associations we observed between 
impacts and specific CMO practices might not indicate a causal effect of the practices. It is possible 
that a practice that is positively associated with impacts may in fact be correlated with some other 
practices we do not observe that are the real driver of student outcomes. But examining associations 
of practices with impacts is the necessary first step toward identifying promising practices. 

1. Comprehensive behavior policies in schools are associated with larger CMO impacts 

Student impacts in math and reading are larger in CMOs whose schools have comprehensive 
behavior policies. We found positive associations between student impacts and multiple measures of 
school behavior policies: consistent behavior standards and disciplinary policies within a school, 
zero tolerance policies for potentially dangerous behaviors, behavior codes with student rewards and 
sanctions, and responsibility agreements signed by students or parents. 

2. CMOs with intensive coaching of teachers tend to have larger positive impacts on 
student achievement 

Student impacts in math and reading are larger in CMOs with schools that place a greater 
emphasis on intensive coaching of new teachers. Impacts are associated with a composite measure 
of teacher coaching that captures the frequency with which teachers are observed and the frequency 
with which they receive feedback on their performance and their lesson plans. In addition impacts 
are larger in those CMOs providing substantial professional development support to their schools. 
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3. Several other notable CMO-level characteristics do not show significant relationships 

with impacts 

We found no significant relationship between impacts and three other factors that we posited 
might contribute to student achievement. Specifically, impacts are not correlated with (1) the extent 
to which CMOs define a consistent educational approach through the selection of curricula and 
instructional materials, (2) performance-based teacher compensation, or (3) frequent formative 
student assessments (although impacts are larger when teachers frequendy use student test results to 
modify lesson plans). Nor are impacts significandy associated with school or class sizes. 

Math impacts are positively correlated with more hours of annual instmction, but this 
relationship appears to be largely due to the association of instmctional time with behavior policies 
and coaching. We ran multivariate regressions of impacts on key practices that were significantly 
associated with impacts in bivariate regressions. In the multivariate regressions, the association 
between impacts and instructional time declined substantially and became not statistically significant. 

F. Questions for Future Research 

As is often the case in studies of this kind, some of the interesting findings raise other 
important questions for the next report from this study and for other research on CMOs: 

• To what extent do CMOs produce positive effects on other student outcomes 
aside from academic achievement as measured by test scores? A subsequent 
version of this report will explore the extent to which some CMOs have an impact on 
two long-term outcomes — high school graduation and college enrollment. 

• Might some CMOs have selected the wrong models to replicate or had difficulty 
replicating promising school models? Over 40 percent of the CMOs covered by our 
impact analysis are falling short of the performance of nearby district schools in math or 
reading. This raises questions about whether some CMOs might be scaling up the wrong 
models, or are attempting to scale up a promising model but have had difficulty 
replicating it. 

• Which promising strategies should CMOs implement and how should they 
implement them? Our forthcoming report on promising practices of CMOs will 
explore this issue in more depth, drawing on both our surveys and qualitative research. 

• To what extent are some CMOs able to consistently improve student outcomes 
across their schools? We hope to explore this issue by estimating school-level impacts 
and examining the variability of impacts within CMOs. 

• To what extent do CMOs add value compared to independent charter schools? 

Whether CMOs can take advantage of scale without losing the advantages of charter 
status (that is, becoming indistinguishable from school districts) is a key question. 

• Are new CMOs using the same strategies and producing the same impacts as the 
established CMOs in our study? Because our study began four years ago, we focused 
on CMOs operating in 2007. It is possible that newer CMOs are either more or less 
effective than the CMOs we examined. 
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• What other factors might contribute to CMO impacts? Among the other important 
hypotheses that we could not examine in detail are ways in which impacts might be 
related to high expectations in the classroom, funding levels, peer effects, grade 
configuration, and specific approaches to classroom instruction. 
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I. INTRODUCTION 

A. Policy Context and Rationale for CMOs 

Charter schools — pubUc schools of choice that are operated autonomously, outside the direct 
control of local school districts — have become more prevalent over the past two decades. Several 
thousand across the country are educating students, and federal policy has supported their growth 
over three successive presidential administrations. Research studies on charter schools have 
multiplied along with the schools themselves, and an increasing number of studies, using a variety of 
methods, have attempted to measure charter schools’ effects on student achievement. Debates over 
the achievement impacts of charter schools have been heated, but no clear consensus has emerged 
about whether, on average, charter schools across the country are doing better or worse than 
conventional public schools at promoting student achievement. Nonetheless, one finding of the 
research on charter school impacts is clear: Effects vary widely among different charter schools. 

Large variation in the performance of charter schools follows direcdy from the design of the 
pohcy itself. A major purpose of charter school laws is to encourage the creation of varied 
educational models. The question for current and future funders and operators is how to identify 
and replicate the more successful charter schools and their effective practices. 

Charter school management organizations (CMOs) — ^which, as the name implies, operate 
multiple charter schools and create new schools under a common structure and philosophy — 
represent one prominent attempt to leverage the success of high-performing charter schools. Many 
CMOs were created to replicate educational approaches that appeared to be effective, particularly 
for disadvantaged students, in a small number of charter or other schools. Attracting substantial 
philanthropic support, CMO schools have grown rapidly from encompassing about 6 percent of all 
charter schools in 2000 to about 17 percent of a much larger number of charter schools by 2009 
(Miron 2010). Some of these organizations have received laudatory attention following reports of 
dramatic achievement results for disadvantaged students. Many of these reports, however, have 
relied on incomplete evidence. 

The National Study of CMO Effectiveness aims to fill the gap in evidence about CMOs, 
providing the first rigorous nationwide examination of the effects of CMO schools on student 
achievement. To provide information on the variability of CMO effectiveness, the study estimates 
achievement effects separately for each CMO for which data were available. In addition, we seek to 
understand the relationships between CMO practices and their effects on student achievement; the 
aim is to inform the field by identifying promising practices associated with positive impacts. 
Mathematica Policy Research and the Center on Reinventing Public Education (CRPE) have 
conducted the study with funding from the Bill & Melinda Gates Foundation and the Walton Family 
Foundation and with project management assistance from the NewSchools Venture Fund. 

This report provides key findings on CMO practices, impacts, and the relationships between 
them. Future reports will explore promising practices in greater depth and examine longer-term 
impacts of CMOs on high school graduation and college entry. 
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B. Research Questions, CMOs in Study, and Data Sources 

The National Study of CMO Effectiveness is designed to describe CMOs in existence in 2007, 
assess their impacts on students, and identify practices associated with positive impacts. Drawing on 
multiple data sources, the study focuses on three sets of research questions: 

1. Characteristics and Context. How quickly are CMOs growing? Which students and 
areas do they serve, and what resources do they use? What are the practices and 
structure of CMOs? What district policies and other contextual factors appear to 
influence the location and growth of CMOs? 

2. Impacts. What are the impacts of CMOs on student achievement and attainment and to 
what extent do these impacts vary across CMOs? 

3. Promising Practices. Which CMO practices and structures are positively associated 
with impacts? 

1. CMOs Eligible for this Study 

Previous studies have defined CMOs in various ways, and these differing definitions have 
influenced which organizations the studies covered. Some researchers include organizations that do 
not directly manage charter schools but that provide a brand name, support network, and other 
academic services; other researchers include organizations whose services are limited to 
administrative functions such as payroll. Although some studies focus only on not-for-profit 
organizations, others have also examined for-profit organizations. Studies also vary in terms of the 
number of schools that must be managed by a CMO (Miron et al, 2010). 

Our study focuses on nonprofit CMOs that directly control four or more schools. More 
specifically, to be eligible for the study, an organization had to satisfy four criteria: it had to (1) have 
four schools open by fall 2007, (2) be nonprofit since inception, (3) not primarily serve dropouts or 
similar special populations, and (4) directly manage schools (that is, hire and fire school principals). 
This definition excludes for-profit organizations, charter networks that lack direct operational 
authority, and organizations that provide only back-office services. 

When the study began, we identified 40 CMOs containing 292 schools that satisfy the four 
criteria. As discussed in Chapter II, these CMOs schools are located in 14 states, with the largest 
concentrations in Texas, California, Arizona, Ohio, Illinois, New York, and the District of 
Columbia. 

2. Study Data Sources 

To examine eligible CMOs and address the research questions, the study makes use of the 
following data sources: 

• Survey of CMO Central Office Staff. Conducted in 2009, this web-based survey was 
completed by managers of 37 CMOs. The survey included questions on the role of 
CMO staff in managing schools, the number of schools operated by the CMO, CMO 
growth goals, the composition of CMO and school staff, educational priorities, teacher 
compensation and evaluation, and staff development, student behavior, and evaluation 
policies. 
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• Survey of CMO and District Principals. The sample of this web-based survey, 
conducted in 2010, comprised aU the principals of 292 CMO schools open by fall 2007 
and an equal number of comparison principals in traditional public schools. The 
comparison schools were the closest district schools with a similar mix of students. 
Approximately 70 percent of the principals responded to this survey. The survey 
included questions on the school’s educational priorities, curriculum, behavioral policies, 
staff development and evaluation, and student assessments as well as on the role of the 
CMO or district central office in managing and supporting the school. 

• Survey of CMO Teachers. The sample for this web-based survey, conducted in 2010, 
comprised randomly selected teachers in each of the CMO schools. We selected two 
teachers per grade for each elementary school and one English and math teacher per 
grade for each middle and high school. Approximately 76 percent of the teachers 
responded to the survey. The survey included questions on participation in staff 
development and curriculum planning activities, observation and evaluation of the 
teacher, job satisfaction and perceptions of the work environment, and the teacher’s 
training and background. 

• Student School Records. For the impact analysis and description of student 
characteristics, we collected administrative records from states, districts, and CMOs. The 
key outcome variables collected were individual student test scores and, where available, 
high school graduation and postsecondary enrollment. We also requested various 
background and demographic characteristics including race, gender, and baseline 
Individual Education Program (lEP) and English language learner (ELL) status. 

• Site Visits. During 2009 the study team visited 10 CMOs and 20 CMO schools. We 
selected the CMOs for diversity in terms of their theory of action, size, and region of the 
country. The visits included semi-stmctured interviews with the senior leadership team 
of the CMOs, two principals, and selected teachers as well as brief classroom 
observations. The research team collected information on the CMO theory of action, 
structural choices, growth strategies and challenges, alignment of CMO and school 
priorities, interactions with authorizers and districts, and teachers’ classroom 
management approach. 

• CMO Financial Records and Business Plans. The team collected the Form 990 tax 
statements for each of the 40 eligible CMOs to estimate per student expenditures. For 17 
of the CMOs, researchers also examined business plans to secure more detail on CMO 
strategies and financial projections. 


C. This Report and Other Study Reports 

This report addresses each of the three sets of study questions. The description of CMO 
characteristics and practices draws primarily from the surveys and school records but also is 
informed by the site visits. The impact analysis focuses on the impact of CMO middle schools on 
math, reading, social studies, and science test scores; this analysis covers 22 of the 26 CMOs with a 
middle school. In addition to providing information on the distribution of impacts for these CMOs, 
we report the extent to which impacts are correlated with specific CMO practices and structures. A 
subsequent report will incorporate findings on high school impacts, including high school 
graduation and postsecondary enrollment. The study will also produce a report describing CMO 
promising practices in greater detail. 
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This report also draws upon the study’s interim report. In 2010 CRPE completed an interim 
study report which describes the practices, challenges, and strategies of CMOs (Lake et. al, 2010). 
Drawing on the survey of CMO headquarters staff and the site visits to 10 CMOs, the report 
described the diverse educational strategies employed by CMOs, the challenges they encounter as 
they expand, and ways they differ from, and are perceived by, school districts. 

The rest of this report is organized as follows: 

Chapter II: CMO Growth, Students, and Resources. This chapter reviews recent growth of 
CMOs, the geographic concentration of CMOs in specific states and districts, the mix of 
students served by CMOs, CMO central office staffing information, CMO student-instmctor 
ratios, and per pupil expenditures of CMOs. 

Chapter III: Practices of CMOs and Their Schools. Drawing on the three surveys of CMO 
headquarter staff, principals, and teachers, this chapter describes the practices and policies of 
CMOs and their schools and the extent to which they differ from district practices and policies, 
and ways in which CMOs bundle multiple practices. It also examines the relationship between 
CMO practices and school-level instructional coherence and organizational health measures. 

Chapter IV: Impacts on Student Achievement. This chapter presents estimates of the 
impacts of individual CMOs on student achievement. We examine the range of positive and 
negative impacts as well as the overall average impact. In addition, the chapter examines the 
impacts for specific subgroups defined by race, ethnicity, and other background characteristics. 

Chapter V: Promising Practices. Combining data from surveys and impact results, this 
chapter examines which CMO practices may be positively associated with student impacts. 

Chapter VI: Questions for Future Research. This chapter identifies some policy-relevant 
questions that could be examined in the future. 
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II. CMO GROWTH, STUDENTS, AND RESOURCES 


Key Findings 

• CMOs account for approximately 17 percent of aU charter schools, with a 
higher percentage in urban areas. The CMO share of the charter sector has 
grown steadily over the past 15 years, although growth appears to have 
slowed recently. 

• Relative to their host districts, CMOs serve disproportionately large 
numbers of students who are black, Hispanic, and low-income, but 
somewhat fewer special education students and English-language learners. 

• Students entering CMO middle schools typically have prior achievement 
levels that are similar to the local average and somewhat higher than the 
local average for black and Hispanic students. 

• CMOs vary widely in per pupil spending, which reflects, in part, variation in 
state and district funding. 

• CMO schools tend to be smaller than schools in their host districts, with 
marginally lower student- teacher ratios. 


A. Introduction 

CMOs operate in the context of state and local laws, local economies, and funders’ priorities. 
CMOs can choose where they locate based in part on whether state and charter laws accommodate 
nonprofit organizations seeking to create networks of effective charter schools. In making location 
decisions, CMOs also need to consider whether they can reach their target population of students. 
Thus the location of CMOs can influence and be influenced by the mix of students they serve. In 
addition, CMOs are funded by federal, state, and local education agencies as well as philanthropies. 
The amount of funding they attract affects the resources they can deploy in their central offices, 
schools, and classrooms. Examining variations among CMOs along these and other dimensions can 
shed light on the role and characteristics of CMOs as well as the findings presented in other chapters 
of this report. 

B. Number and Geographic Distribution of CMOs 

This section presents nationwide data on the overall CMO landscape, including the number of 
CMOs nationally, where they are located, how fast they are growing, and how their growth trajectory 
compares to charter schools that are not operated by CMOs (as a sector). For those questions, we 
focus on the broadest population of CMOs, employing the definition of CMOs identified by the 
National Alliance for Public Charter Schools (NAPCS) and by the study team’s additional 
investigation (Miron et al. 2010). This all-inclusive identification of CMOs includes approximately 
130 nonprofit organizations that manage more than one charter school. Note that other sections of 
this chapter and the rest of this report employ a narrower definition of CMOs that includes only 
organizations managing four charter schools as of fall 2007 and excluding organizations that had 
once operated as for-profit organizations, serve only dropouts, or that do not have direct operational 
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control over their schools. We contrast the broader set of 130 CMOs with the 40 CMOs that satisfy 
the narrower definition of CMO used in our study. For some purposes we rely on data for students 
entering CMO middle schools, for which we are able to systematically compare the baseline (i.e., 
pre-CMO entry) characteristics of students with those of students in the same grades and districts. 
This is useful, for example, in assessing the prior achievement levels of students entering CMOs. 

1. CMOs Represent A Growing Presence in The Charter Landscape 

Using the broadest definition of CMOs, there were more than 130 CMOs operating 816 
schools as of the 2009-2010 school year.' Nationwide, these CMOs served nearly 250,000 students, 
representing approximately 15 percent of all charter school students and 0.5 percent of total public 
school enrollment. CMO-run schools constitute approximately 20 percent of the nearly 5,000 
charter schools operating nationally in 2009-2010, compared to just 13 percent in 2000 (see 
Figure II.l). 

The number of new CMO schools has grown rapidly alongside the number of CMOs. In 1993, 
the broad universe of CMOs accounted for just 5 percent of all new charter schools (see 
Figure II. 2). At their peak growth in 2007, CMO-mn schools represented 25 percent of all new 
charter schools. The rapid growth in CMOs and affiliated schools partly reflected the substantial 
philanthropic investments in CMOs. These donations totaled more than $500 million since 2000 
(Lake et al., 2010). 



Source: Analysis of data from the National Alliance for Public Charter Schools. 

Note: The Non- CMO schools category includes both standalone charter schools and for-profit 

management organizations operating charter schools. 


'Again, of the 130 organizations satisfying this broad definition of CMO (non-profits operating more than one 
charter school), 40 satisfied the specific criteria for inclusion in this study^. 
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Figure 11.2. Share of New Charter Schools That Were CMO- Operated, by Year, 1 993- 2009 



Source: Analysis of data from the National Alliance for Public Charter Schools. 

By 2009, the rate of growth in CMO schools appears to have slowed. In that year, CMOs were 
responsible for only 10 percent of new charter schools. It is too early to say whether this represents 
a trend. Assuming CMO growth has slowed, there are numerous possible explanations including 
(but not limited to) reduced philanthropic investments, effort to slow growth to enhance quality, 
reluctance by authorizers to issue charters to CMOs, and lack of facilities. 

2. CMO Presence Is Concentrated in States with Growth-Friendly Charter Laws and in 

Several Urban Areas 

Nationally, charter school growth is concentrated in some states, such as California and 
Arizona. The growth of CMO schools is even more concentrated in many of the same states. As 
Figure II. 3 shows, 80 percent of all CMO-mn schools (again using the broadest definition of CMO) 
operate in Texas, California, Arizona, and Ohio compared to 40 percent of charter schools overall. 
More than half the states that have charter schools do not have any CMO-operated schools. 

An important aspect of the pohcy context that may influence where CMOs choose to open and 
operate schools is the amount of autonomy granted by state law. Greater autonomy gives CMOs 
more freedom to operate their schools in the manner they see fit, and makes it easier to expand the 
number of schools they operate. 

Indeed, most of the states attracting substantial numbers of CMOs provide at least a moderate 
amount of autonomy to charter schools (Table II. 1). Using indicators developed by the NAPCS, we 
developed an overall measure of charter autonomy for each state with a charter school law. The 
indicators measure the degree to which state law provides for: (1) fiscally and legally autonomous 
schools with independent charter school boards, (2) automatic exemptions from many state and 
district laws and regulations, (3) automatic collective bargaining exemptions, and (4) the ability to 
operate more than one campus under one charter contract and/ or board. These are factors that the 
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Figure 11.3. Distribution of CMO Home Offices and CMO- Operated Schools by State, 2009 
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Source: Analysis of data from the National Alliance for Public Charter Schools. 


Table 11.1. CMO Location by Amount of Autonomy Offered in State Law 


Autonomy 

Category 

'Autonomy 

Index’ 

Score 

Number of 
States in 
Category* 

States in Each 
Category* 

States with 
CMOS 

Number of CMOs 
Operating in All 
States in Each 
Category 

High 

12-16 

10 

AZ, CA, DC, DE, FL, ID, 
NH, OK, OR, UT 

AZ, CA, DC, FL, 
OR 

64 (in 5 states) 

Medium 

8-1 1 

18 

AR, CO, CT, GA, IL, LA, 
MA, Ml, MN, MO, NC, 

NJ, NM, NY, PA, SC, TN, 
TX 

CO, CT, GA, IL, 
LA, NJ, NY, PA, 
TX 

83 (in 9 states) 

Low 

0-7 

12 

AL, HI, lA, IN, KS, MD, 
NV, OH, Rl, VA, Wl, WY 

IN, OH, MD 

1 1 (in 3 states) 

None/no 
charter law 

n.a. 

1 1 

AL, KY, ME, Ml, MT, NE, 
ND, SD, VT, WA, WV 

0 

0 


Source: Analysis of data from National Alliance for Public Charter Schools 


n.a. = not applicable. 

CMO executives we interviewed said influence their location decisions.^ Ohio is one of the few 
states with low levels of autonomy attracting many CMOs. 


2 States were scored on a scale of 0 (low) to 4 (high) on each of these indicators, and we weighted the indicators 
equally to create an overall ‘autonomy index’ with possible scores ranging from 0 to 16. We then created four autonomy 
categories corresponding to four equal ranges of the index scores. We collapsed the two lowest autonomy categories into 
one as states in the lowest two ‘low autonomy’ categories have managed to attract fewer than a do 2 en CMOs. 



II. CMO Growth, Students, and ^sources 


CMO schools are also concentrated in urban areas in general and certain cities in particular. 
Seventy-four percent of the schools of the 40 CMOs eligible for our study are located in cities 
(defined as small, mid-sized, or large cities by U.S. Census). CMOs represent a large fraction of the 
charter schools in New Orleans, Newark, Los Angeles, Chicago, Oakland, New York City, the 
District of Columbia, Sacramento, and Houston (see Figure II.4.). In the cities most saturated with 
charter schools, CMOs have significant presence, typically representing 30 to 50 percent of all 
charter schools. 

This pattern may partly reflect CMOs’ effort to develop dense networks of schools that can be 
easily supported by CMO staff. Several CMO leaders reported that they have sought to restrict their 
growth to certain metropolitan areas in order to capture local economies of scale. These staff said 
they seek to support growth and maintain quality across their network through close monitoring and 
active support of their schools (Lake et al. 2010). 


Figure 11.4. CMOs’ Market Share in Larger Charter Markets 



Source: Charter statistics provided by National Alliance for Public Charter Schools; Non- CMO charter 

data collected by Center on Reinventing Public Education from state and charter agencies. 
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3. Relative to the Urban Districts in Which Most CMOS Are Situated and Their Own 

Growth Goals, Most CMOS Are Small Organizations 

Some CMOs have encountered challenges when they sought to expand. These challenges 
include recmiting staff with the skiUs needed to implement or adapt the CMO’s educational 
approach, maintaining consistent quality across many schools, and avoiding rigid bureaucracy. These 
issues are discussed in the study’s interim report (Lake et al. 2010), which noted that the types of 
challenges CMOs encounter can change as it grows. 

Most CMOs oversee a modest number of schools relative to the districts in which they operate. 
Among the CMOs eligible for our study (all of which manage at least 4 schools), about 57 percent 
manage 10 or fewer schools (Figure II. 5); in the broader universe of CMOs (those managing two or 
more schools), about 87 percent of CMOs manage 10 or fewer schools (Figures II. 6). On one hand, 
the average CMO in our study is larger than 87 percent of aU school districts in the country in terms 
of number of schools. However, compared to the urban school districts where they are typically 
located, they are much smaller, with the average urban district overseeing 32 schools.^ Moreover, 
staff at most of the CMOs in our study say they would like to grow, hoping to add up to 20 schools 
each by 2025 (Lake et al. 2010). By those standards, most CMOs should still be considered in their 
early development stage, with the average CMO just entering the size range (beyond 8-10 schools) 
that can cause significant expansion challenges (Lake et al. 2010). 

Figure 11.5. Number of Schools Managed by CMOs Eligible for This Study, 2009 


■ 4-6 schools 

■ 7-10 schools 

■ 11-15 schools 

■ 16-25 schools 



Source: CMO Central Office Staff Survey. 


3 We define an urban district as any district that is classified as either a small, mid-sized, or large city by the U.S. 
Census and serves four or more schools. 
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Figure 11.6. Number of Schools Managed by CMOs in Broader Universe, 2009 


2 % 



■ 2-3 schools 

■ 4-6 schools 

■ 7-10 schools 

■ 11-15 schools 

■ 16-25 schools 

■ 26 - 1 - schools 


Source; Analysis of data from National Alliance for Public Charter Schools. 

The growth trajectory of individual CMOs varies dramatically according to their mission and 
capacity. Some pursue aggressive growth, aiming to open 3-5 new schools per year. Others aim for 
much slower expansion. On average, CMOs in our study are opening no more than one new school 
per year for the first six years (see Figure II. 7). After seven years of operation, the average pace picks 
up to approximately two new schools per year. This pace of expansion is faster than that of the 
broader universe. By Year 10, the average CMO in our study had approximately 13 schools, while 
the CMOs in the broader universe had approximately 6.'* 

C. Students Served 

In this section we rely on data for students entering middle schools in 23 CMOs for which we 
were able to acquire student-level data (including all of the CMOs included in the middle-school 
achievement analyses described in Chapter IV). Using information on students prior to CMO entry 
allows us to examine baseline achievement levels of CMO entrants (which would be impossible for 
students entering CMO schools in kindergarten), and it provides more confidence that data on 
classifications related to poverty and special needs were consistently collected (rather than potentially 
being biased by differences in data collection processes of conventional public schools and CMO 
schools). 


'' Approximately 54 percent of CMOs in our study and 58 percent of the broader universe have been operating for 
10 or more years. 
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Figure 11.7. Number of Schools Per CMO, by Year of Operation, for Study CMOs and Broader Universe 



Source: CMO Central Office Staff Survey; broader CMO “universe” data from National Alliance for 

Public Charter Schools. 

1. CMO Schools Serve a Greater Share of Minority and Low-Income Students Than Do 
Their Districts of Residence, But Fewer Students with Special Needs or Limited 
English Proficiency 

The decision by several CMOs to operate in large urban areas reflects not only an interest in 
achieving local economies of scale but also their mission to serve disadvantaged students. AH but 
two CMOs in our sample serve mostly Black and Hispanic (“minority”) students (as Figure II. 8 
shows). As indicated in the figure, over half of sample CMOs at the middle school level serve nearly 
all Black or Hispanic students, while others serve a mix of both groups, and in some cases other 
students, such as Caucasian and Asian students. (High school-level results are similar.) 

Even when compared to their host districts (which tend to be urban and high-minority), the 
student population served by the average CMO middle school in our study includes a greater 
percentage of minority students (see Figure II.9). This may reflect CMOs’ efforts to target minority 
families and communities within their host districts. In the average CMO, approximately 91 percent 
of (middle-school) students are minorities compared to 76 percent of their host districts’ students in 
the same grades (Table II.2). In Figure II. 9 the CMOs with bars above the x-axis (horizontal line) 
have a greater proportion of minority students than their host districts. 
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Figure 11.8. CMO Middle Schools Primarily Serve Black and Hispanic Students 
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Source; State and district school records. 

Figure 11.9: CMO Middle Schools Serve a Greater Percentage of Minority Students Than Their Districts 
of Residence 



CMOS 


Source: State and district school records 

Note: Blue bars represent statistically significant differences at the S% level 


13 



II. CMO Growth, Students, and ^sources 


Table 11.2. Student Demographics in CMO Middle Schools and Host Districts (Same Grades) 



Percentage 

Minority 

Percentage Free 
and Reduced- 
Price Lunch 

Percentage 
Limited English 
Proficient 

Percentage with 
lEP 

CMO Average 

91% 

71% 

14% 

9% 

CMO Host District Average 

76% 

64% 

19% 

13% 


Source: CMO and district numbers are from state and district school records; these are unweighted 

simple averages (rather than student- weighted averages). 

NA = not available. 

CMOs also appear to focus on serving low-income students, although relevant data are only 
available for 11 of these CMOs. For the average CMO, approximately 71 percent of students 
entering middle school qualify for free and reduced-price lunch, compared to 64 percent of students 
served by their host districts in the same grades (see Figure 11.10). Eight of the CMOs tend to serve 
significantly more low-income students than their corresponding districts, while only two serve 
fewer low-income students. 


Figure 11.10. CMO Middle Schools Serve Greater Percentage of Students Who Qualify for Free and 
Reduced- Price Lunch than Host Districts 



Source: State and district school records. 

Note: Blue bars represent statistically significant differences at the 5 percent level. 

FRPL = free or reduced- price lunch. 


14 


II. CMO Growth, Students, and ^sources 


On the other hand, special education students and English-language learners appear to be less 
likely to enroll in CMO schools (see Figures II.ll, 11.12). On average, nine percent of the students 
entering these CMOs are students with disabilities that qualify them for Individualized Education 
Plans (lEPs), compared to 13 percent of host district students (Figure II.ll). Similarly, students with 
limited English proficiency (LEP) appear to be less likely to enroll in CMOs (Figure 11.12). It is 
possible, however, that these differences are over-stated because of differences in the way schools 
categorize special needs and LEP status.^ 

Figure 11.1 1. Most CMOs Serve Fewer Special Education Students than Host Districts 


a 




CMOs with Middle Schools 


(/) 

> 

o 

o 

"5 oi 

fcO "C 
" 2 
-a 

= I 

o ^ 

U « 
■E § 

c u 

(U (/) 


“I 1 1 1 1“ 


“I 1 1“ 


...Yrrm 


llllllll 



III 


1 


Source: State and district school records. 

Note: Blue bars represent statistically significant differences at the 5 percent level. 


^^The data on lEP and LEP status are drawn from records of students prior to entering the CMO school and for all 
students in the same grades in district schools. Most of the CMOs draw somewhat more students from charter feeder 
schools than do regular district schools in their host districts. If charter schools systematically under-identify special 
needs and limited English abilities, then the rates of special needs and LEP status may not be fuUy comparable for the 
CMOs and districts. 
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Figure 11.12. Most CMOs Serve Fewer Limited English Proficient Students than Host Districts 



Source: State and district school records. 

Note: Blue bars represent statistically significant differences at the 5 percent level. 

2. Students Entering CMO (Middle) Schools Typically Have Prior Achievement Levels 
That Are Similar to the Local Average and Somewhat Higher Than the Local Average 
For Black And Hispanic Students 

We also examined students’ academic performance before enrolling in either a CMO middle 
school. The majority of CMOs — 18 of 23 — enroU students whose prior achievement levels are 
reasonably similar to (within a quarter of a standard deviation) those of other students in their 
districts. Two CMOs enroll students with baseline test scores that are more than 0.25 below those of 
their districts, while three CMOs enroll students with baseline test scores that are more than 0.25 
above those of their districts (see Figure 11.13). 
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Figure 11.1 3. Incoming Reading and Math Scores for CMO Students and Their District Peers 



Source: State and district school records. 

Dark blue and red bars represent statistically significant differences between CMOs and their host districts 
at the .05 levels, two- tailed tests. 

However, most CMOs attract somewhat higher achieving students of color relative to those 
served by their host districts. Thirteen of 22 CMOs in our sample serve black students who had 
significandy higher average pre-entry reading test scores than the averages for their black peers in 
the host district; only two CMOs served black students with scores significandy lower than those of 
black students locally. Likewise, the pre-entry reading scores of Hispanic students in 13 of 23 CMOs 
were significantly higher than Hispanic averages locally, and only three CMOs served Hispanic 
students with significandy lower baseline reading achievement than that of other Hispanic students 
in their districts.'^ The percentages are similar for reading test scores. Thus, while CMOs attract a 
disproportionate number of black and Hispanic students, these students tend to have higher test 
scores on average when they enter the CMO than their black and Hispanic peers in the host 
districts. 


The number of CMOs examined differs for black students and Hispanic students because one CMO has no black 
students. 
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3. Like Many Independent Charter Schools, Many CMO Schools Offer Grade 
Configurations, That Differ from Those of District Schools, Often Spanning More 
Grades Than Those in Conventional Public Schools 

CMOs have the abiUty to choose which grade levels they wish to serve overall and in a single 
school. Many use that flexibility to develop schools that have a broader range of grades than is the 
case among most district schools. As Figure 11.14 shows, K-8 schools are fairly common in CMOs, 
as are other configurations that span traditional middle and high-school grades or even elementary, 
middle, and high-school grades. Not captured in the flgure is the fact that CMO middle schools are 
also often configured unconventionally, serving grades 5-8 instead of 6-8 or 7-8. 

Approximately one-third of CMOs appear to specialize in schools with a single grade 
configuration (e.g., offering only middle schools or only K-8 schools). The rest combine more than 
one category, often in feeder patterns (e.g., an elementary CMO school near a middle school). 

Figure 11.14. Percentage of CMO Schools Serving Various Grade Level Categories 

30% n 


(/) 



Elementary Middle High K - 8 Other 

Grade Category 


Source: Data assembled by study team from CMOs 

Note: Elementary includes all schools that serve only students in grades K- 6, middle includes all 

schools that serve only students in grades 5- 8, high includes all schools that serve only 
students in grades 9-12, K-8 includes all schools that span grades K-8 but do not include 
high- school grades, and other includes all schools that serve any other mix of grades, 
spanning middle and high or elementary, middle, and high. 
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D. Resource Use 

1. Per-Student Spending Reflects State Charter Funding Patterns 

As with other charter schools and school districts, CMOs are funded through a combination of 
public funding and charitable donations or grants. Expenditures for the 39 study CMOs for which 
data was available ranged from $5,000 to $20,000 per pupil (see Figure 11.15). This variation appears 
to be driven in part by the per-pupn funding amounts determined by public funding formulas in 
each state’s charter law (Figure 11.16). The correlation between per pupil CMO expenditures and per 
pupil state funding of charter schools is 0.61. 

In addition to state funding, some CMOs receive philanthropy and other funding. At least 9 of 
these CMOs spend more than $1,000 per pupil beyond the amounts allocated from public sources 
and four CMOs spend more than $4,000 per pupil more. Because public per-pupn revenue 
allocations to charter schools are often lower than those granted to traditional public schools, it 
remains unclear whether the CMOs’ per-pupil spending is larger or smaller than that of nearby 
district schools. 

2. Most Central Office Staff Devoted to Educational Support, Operations, and Finance 

CMOs look very much like school districts, both in organizational structure and functions 
served. Central office staff provide supports, services, and oversight for the schools they manage. 
Among the 37 CMOs responding to our survey of central office staff, the majority of CMO 
positions are directed at educational supports (such as professional development, coaching, 
assessment, and data analysis), operations (such as payroll and facilities management), and finance. 
Figure 11.17 shows how these CMOs allocate their central office staff by functional area, as reported 
by each CMO. Some CMOs invest heavily in large central offices, while others maintain a fairly 
minimal level of administrative staff. Decisions about how to allocate central staff appear to be more 
a function of CMO preference than a function of size, and CMOs vary widely in how they allocate 
their staff across categories. 

The overall size of the central office in relation to number of students served also varies widely. 
Although one might expect this ratio to drop as CMOs grow because of economies of scale, there is 
no significant relationship between size and staff-to-student ratio. This may be because some large 
CMOs attempt to provide more coaching, guidance, or other support to their schools. 
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Figure 11.1 5. Variation in Per- Pupil Expenditures by CMO 


CMO Per- pupil Expenditures 



Source: IRS 990 forms for tax year 2009. 


Figure 11.16. CMO Expenditures Compared to Per- Pupil Public Funding Revenues 
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Source: CMO expenditure data from IRS 990 forms for tax year 2009, per- pupil revenue data from Ball 

State University, Charter School Funding: Inequity Persists. 

PPE = per- pupil expenditures; PPR = per- pupil revenue. 
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Figure 11.1 7. Average Distribution of CMO Staff, by Functional Category 


Majority of FTEs Directed at Educational Supports and Operations 



Percent of FTEs in Various Functional Areas 


■ Development/Fundraising 

■ Technology 

■ Marketing/Enrollment 

■ School data colleaion and 
analysis 

■ HR 

■ Other 

■ Finance/Accounting 

■ Operations 

■ Educational supports 


Source: CMO Central Office Staff Survey 

FTE = fuli- time equivaient. 

3. CMO Schools Tend To Be Smaller Than Schools in Their Host Districts, with 

Marginally Lower Student-Teacher Ratios. 

CMOs are generally free to restrict their school and classroom sizes as they choose. Because 
their organizational mission often focuses on personalized learning and a strong, intimate school 
culture, one might expect CMOs to have smaller school and class sizes. 

Indeed, CMO schools are much smaller than nearby schools in their districts (Figure 11.18). The 
CMOs in our study average 389 students per school compared to 982 students per school in nearby 
district schools. CMO schools have approximately the same number of grades per school as their 
district counterparts so the difference in total enrollment is largely due to a smaller number of 
students per grade. 

Class sizes and student-to-teacher ratios are also somewhat smaller in CMO-run schools than in 
their host districts, although the average differences are not as great as the school size differences. 
The average pupil-to-instructor ratio in math and reading are about 20.9 students; by contrast in 
comparison schools this ratio is 23.5 in math and 23.2 in reading. These CMO-district differences 
are statistically significant in math but not in reading. 
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Figure 11.18. Enrollment Per School in CMOs Compared to Nearby District Schools 
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III. CMO PRACTICES AND SCHOOL OUTCOMES 


Key Findings 

• Relative to district principals, CMO principals report that their schools provide more 
instructional time, are more likely to have a comprehensive school-wide behavior 
policy, practice more frequent teacher coaching and monitoring, and place more 
emphasis on performance-based compensation 

• According to principals, CMOs are less likely than districts to prescribe a particular 
curriculum, behavior pohcy, or staff evaluation approach; however CMO principals are 
more likely to report implementing a school-wide behavioral strategy than their district 
counterparts. 

• Different groups of CMOs tend to adopt distinct sets of practices: those emphasizing 
behavior policies also tend to have longer instructional time while those emphasizing 
performance-based compensation tend to make frequent use of formative assessments; 
both of these groups provide intensive teacher coaching 


A. Introduction 

In this chapter, we describe the policies and practices of CMOs and CMO schools. Any effects 
that CMOs have on student achievement presumably occur through their influence on their schools 
and on students’ educational experiences. Our study is configured to identify some of the ways that 
CMO school practices and policies differ from those of nearby district schools. We also explore the 
differences among CMOs in their central office-level and school-level practices. Drawing upon 
CMO central office, principal, and teacher surveys, this chapter addresses the following questions: 

1. How do CMO school policies and practices differ from district schools? What are the 
key elements that distinguish CMO instructional systems, human capital strategies, and 
their approach to school culture and parental involvement? 

2. To what extent do CMOs centralize decisions about school-level practices? Are CMOs 
more or less prescriptive than districts? How does prescriptiveness vary across CMOs? 

To what extent are schools within CMOs consistent in their implementation of specific 
practices? 

3. How much variation is there in CMO practices? Is it possible to identify types of CMOs 
that place more or less emphasis on specific sets of practices? 

4. To what extent do CMO practices appear to influence the instructional coherence and 
organizational health of their schools? 

Any differences in practices among CMOs (relative to nearby district schools) could potentially 
contribute to differences in their impacts on student achievement. Before estimating CMOs’ 
achievement impacts, we identified a number of practices and structures that might be associated 
with impacts. These practices were selected using the following three criteria. First, we attempted to 
identify characteristics that CMOs or researchers believe is a promising practice or structure that is 
important to improving achievement. Second, using our surveys, we looked for practices where 
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there is substantial variation across CMOs in the characteristic and, most importantly, in the 
differences between each CMO and its comparison schools. Third, using the principal survey, we 
sought to identify practices that distinguish CMOs schools from nearby district-operated schools. 

The practices included among our seven “primary hypotheses” are the focus of our descriptive 
analyses in this chapter. We also describe some additional practices that distinguish CMO schools 
from district schools and could impact student outcomes. The seven primary hypotheses focus on 
whether impacts are associated with the following seven CMO characteristics: 

1 . Amount of instructional time 

2. Consistent educational approach, including curriculum and instructional materials 

3. Student behavior policies that include specific rewards, sanctions, and commitments 

4. Intensive teacher coaching and monitoring 

5. Performance-based teacher evaluation and compensation 

6. Frequent review and analysis of student formative assessment data 

7. Number of CMO schools 

The first six characteristics are practices that are discussed in this chapter; Chapter V examines 
how these practices relate to impacts on student achievement. Chapter II contains information on 
the number of schools in each CMO and Chapter IV examines how this variable relates to impacts. 

We also hypothesized that CMO practices may have an effect on student achievement indirecdy 
via two mediating factors: (1) instructional coherence — the degree to which different aspects of the 
school’s instructional program reinforce one another, and (2) organizational health — the degree to 
which the school is efficient and effective in maintaining a stable environment with few 
administrative problems. Instmctional coherence might be impacted by whether a CMO has a 
consistent educational approach, for example. Organizational health may be influenced by factors 
such as centralization of decisions at the CMO level or the CMO’s size. 

Methods 

Several of our measures of CMO practices are composite variables. These composites were 
created by combining closely related survey items into a single measure, reducing measurement error 
and capturing the breadth of a constmct. See Appendix A for details on how these composites were 
created. 

To measure the CMO characteristics described in this chapter, we chose to rely primarily on the 
principal survey, rather than on our central office staff survey or teacher survey, for several reasons. 
First, relative to the central office survey, the principal survey reflects the perceptions of 
respondents who are closer to the implementation of CMO practices in schools. Second, the 
principal survey provided a unique opportunity to contrast practices in CMO schools with those in 
nearby district schools. Each CMO principal eligible for the survey was matched with the principal 
of the district school with the same grade range in closest geographic proximity, and we attempted 
to survey all matched comparison principals in addition to all CMO principals. 
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Our unit of analysis is the CMO.’ Some of our measures incorporate survey questions that ask 
about policies and activities of the CMO central office. However, we also rely on principal survey 
responses about school activities to infer overall CMO-level characteristics, averaging principal 
responses within each CMO and within the group of comparison schools associated with that CMO 
using weights to adjust for nonresponse.^ There is substantial consistency within CMOs in these 
practices relative to the variation between CMOs (see Appendix A.). 

We acknowledge that our measures of CMO practices rely on principal reports of CMO 
practices and not on direct observation of these practices in CMO schools. Given that the surveys 
were conducted for a study of CMOs, it is possible, for example, that CMO principals tended to 
report school-level practices in the most positive light. We examined correlations between similar 
teacher and principal survey responses and confirmed that all correlations were positive and, on 
most individual items, substantial. In addition, when measuring school-level intermediate outcomes 
in the domains of instructional coherence and organizational health, we used the teacher survey 
exclusively for all measures of instructional coherence and for a measure of teacher satisfaction in 
the organizational health domain. 

In this chapter, we begin by describing average differences in CMO and district school practices 
in the following areas, in turn: instructional time, educational approach, student behavior strategies, 
and teacher effectiveness. We also describe the variation across CMOs in the emphasis on these 
practices in their schools. Next, we use cluster analysis to explore whether CMOs can be classified 
into groups based on their management strategies and bundling of key practices. Finally, we describe 
associations between the intermediate school outcomes in instructional coherence and 
organizational health and the school practices identified in the first part of the chapter. 

B. CMO and School Practices: Significant Differences from District Schools 

CMO and district schools appear to differ in many potentially important respects. Based on the 
principal surveys, it appears that CMO and district schools are appreciably different in terms of their 
instructional time, educational approach, student behavior policies, compensation and evaluation 
policies, and strategies for teacher coaching and monitoring. Although we focus on the noteworthy 
differences here, there were also areas of similarity. For example, the frequency with which CMO 
and district school and central office staff review and analyze student assessment data — one of our 
primary hypotheses — was not significantly different (Table III.l). 


* The unit of analysis for the comparison sample corresponds to the group of district schools matched with the 
schools within a CMO. Therefore, for CMOs with schools located in more than one district, this comparison unit 
aggregates responses from multiple districts, weighted by the number of CMO schools located in each district. 

2 Of 292 CMO principals eligible for the study, 76 percent responded to the survey. Among the 292 matched 
comparison principals, the response rate was 59 percent. Five CMOs eligible for the study declined to participate in the 
survey. We developed weights to adjust for principal nonresponse. 
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Table III.1. CMO Schools and District Schools Diverge on Most Primary Hypotheses Practices 


Primary Practice 

Significant CMO- District Difference? 

Consistent educational approach, including 
curriculum and instructional materials 

V 

Student behavior policies that include specific 
rewards, sanctions, and commitments 

V 

Frequent review and analysis of student formative 
assessment data 


Intensive teacher coaching and monitoring 

V 

Performance- based teacher evaluation and 
compensation 

V 

Amount of instructional time 

V 


1. On Average CMOs Offer More Instructional Time 

Recent literature suggests that extended learning time may be a key component distinguishing 
more successful schools.^ Expanding learning time is a strategy implemented by some CMOs. We 
explored the variation in instructional time in CMOs relative to districts and examined differences 
after decomposing instructional time into hours per day and days per year. 

Relative to districts, CMO schools tend to require significantly more time in school both during 
and outside of the academic year. On average, CMOs provide 1,373 hours of instruction per year 
compared to 1,239 for districts, a difference of 134 hours (Table III.2). The difference between 
CMO and district schools is driven more by extending the length of the school day than by 
extending the length of the school year. CMO schools have significantly longer school days. The 
average length of a CMO school day, as reported by CMO principals, is 7.5 hours, while the mean 
district comparison school reported a school day of 6.9 hours. CMO schools and district schools 
report similar numbers of instructional days per year, with a mean of 182 at the CMO level and a 
mean of 1 80 at the district comparison level. 


Table III.2. CMO and District School Instructional Time Practices^ 


Practice 

n 

CMO Mean 

Comparison Mean 

p- value 

Yearly Instructional Hours 

36 

1,373 

1,239 

p<.001 



(185) 

(98) 


Required Summer Session Prior to 

36 

0.1 7 

0.05 

p=.02 

Enrollment, Proportion 


(0.27) 

(0.11) 



Source: Principal Survey. 


3 Angrist et al. 2011. 

p=0.0001 

5 In this table and throughout this chapter, the significance of the CMO-comparison difference was tested using a 
two-tailed t-test with unequal variance assumption. P-values smaller than 0.05 were considered significant. 
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Although the number of days per year in CMO and district schools is similar, CMO principals 
were more likely to report the use of a mandatory summer session for new students prior to 
enrollment. An average of 17 percent of principals in a CMO report this requirement, relative to 5 
percent among district schools (p=0.02). 


Although CMO principals report more instructional time than district principals on average, the 
mean masks the substantial variation across CMOs in the amount of time students are required to be 
at school. Approximately one-fifth of the CMOs require an average of more than 1,500 hours 
annually, while 36 percent require 1,300 hours or fewer. As Figure IIl.l indicates, there is 
substantially more diversity in instructional time requirements among CMO schools than among 
district schools. Despite the significant variation across CMOs in instmctional time,'’ standards 
appear to be very consistent within CMOs." 


Figure IIl.l. CMO Students Spend More Time in School 



Source: Principal Survey. 


2. CMO Principals Report More Autonomy Choosing Their Curriculum 

Independent charter schools are often established in order to provide more flexibility in 
defining an educational approach. As collections of charter schools, CMOs could in theory either 
provide principals with substantial discretion in selecting curricula and instmctional materials or 
require all schools to adopt the same curriculum. We investigated the extent to which CMO 
principals perceive flexibility in their curricular choices, whether they report more or less autonomy 
relative to district principals, and how these practices vary within and across CMOs. 


A one-way analysis of variance (AN OVA) confirms that the CMO means are not identical, and the average of the 
CMO means in the top half is significantly larger than that of CMOs in the bottom half (p<.0001). 

^ The intraclass correlation coefficient for yearly instructional time is 0.64, indicating that substantial variance 
among CMO schools is explained by clustering at the CMO level rather than by school-level practice. 
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Relative to districts, CMOs are less likely to mandate a specific curriculum or instmctional 
materials and more frequently allow their schools to make such choices independently. On a 
composite measure of central office-level centralization of educational approach, CMO schools 
reported significantly less centralization than district schools, on average. This difference was driven 
primarily by of the extent of central office prescriptiveness in selecting instmctional materials. As 
indicated in Figure 111.2, significandy fewer CMO principals report that their central office plays a 
role in selecting some books or instructional materials than comparison principals. CMO schools are 
also less likely than district schools to report that their central office is responsible for actually 
choosing their curricula or instmctional materials. 


Figure III.2. CMO Schools Report More Autonomy in Choosing Instructional Materials 


u 


Ol 

C 


Q-r 


O 
CL 

P 


u 

o _ o 

5^ 1/1 
TO •— 

b 

Qj Cl 

^ E 

QJ O 

Q- u 

C s- 
n3 O 
a; 


90% 

80% 

70% 

60% 

50% 

40% 

30% 

20% 

10% 

0 % 



Central office provides support Central office chooses 

selecting some books or instructional curriculum/instructional materials** 
materials** 


■ CMO 

■ District 


Source; Principal Survey. 


**Statistically significant difference; p<. 01 . 


The overall extent to which CMOs centralize decisions on curriculum and materials varies 
significantly across CMOs but appears to be relatively consistent within CMOs.^ CMO practice 
varies most substantially with regard to whether the CMO central office plays any role in helping 
select some books or instructional materials: in one-third of CMOs, most^ principals report no 
central office involvement, while just over one-fifth of CMOs have very mixed principal responses, 
and the remaining CMOs are reportedly more consistently involved with their schools on this 
dimension. On the other hand, most CMOs do not expUcitly select curricula for their schools, and 
principal responses indicate consistency in the policy within CMOs. Figure 111.3 illustrates that, in a 
majority of CMOs, most principals report that they have flexibility in choosing curricula or 
instructional materials; that is, the central office does not mandate a specific curriculum. 


® The intraclass correlation coefficient for the composite measure of centralization of educational approach is 0.51. 
® Two-thirds or more of principals. 
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Figure III. 3. Most CMO Principals Report Consistency in Decentralization of Curriculum 

Distribution of CMOs by Within- CMO Consistency of School Responses 


Fraction of Schools within 
CMO Reporting that Central 
Office Selects Curriculum 

■ 1/3 or fewer 

■ Between 1/3 and 2/3 

■ 2/3 or more 


Source: Principal Survey. 

3. CMO Schools Report Comprehensive Behavior Policies, but Flexibility in Defining the 

Details 

Independent charter schools often have a fair amount of autonomy in defining behavior 
policies relative to district schools, but there has been relatively litde information on how CMOs 
influence their schools’ behavior policies. CMOs can potentially influence the behavior policies of 
their schools in two ways. The first is the extent to which they encourage their principals to develop 
and enforce some type of comprehensive behavior code. The second is in whether they give 
principals flexibility in defining the details of that behavior policy, allowing different schools within 
the CMO to adopt different policies. We first examine school-level practices in developing a 
comprehensive behavior policy and then turn to CMO-level prescriptiveness, as reported by 
principals, in mandating specific disciplinary and behavior policies across their schools. 

CMO principals were more likely than district principals to report that their schools define and 
enforce a comprehensive set of behavioral standards and require responsibility agreements. We 
found significant differences between CMO school practices and district school practices, on 
average, on a composite measure of comprehensive behavior policy that includes, among other 
items, the use of rewards and sanctions and the inclusion of a zero tolerance pohcy (p<.01). The 
difference between CMO and district school practice appears to be driven by two components 
(Figure III. 4). First, significandy more CMO schools implement consistent school- wide behavior 
codes: on average, 95 percent of principals within a CMO report that behavioral standards and 
discipline policies are established and enforced consistendy across the entire school. Second, CMO 
schools are significandy more likely to require parents or students to sign an agreement describing 
their responsibilities. 
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Figure III.4. CMO Schools Emphasize Behavior Standards and Responsibility Agreements 
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**Statistically significant difference; p<.01 . 

That said, CMOs appear to be less Ukely than districts to require aU their principals to adopt a 
uniform behavior strategy. Rather, CMO principals report more flexibility in defining the details of 
behavior policies than do district principals. On a composite measure of the extent to which central 
office staff provide or mandate disciplinary policies and behavior codes, CMOs averaged 
significantly lower levels of centralization (p=.007), as illustrated with a specific survey item response 
in Figure III. 5. 


Figure III. 5. More Flexibility for CMO Schools in Defining Behavior/Disciplinary Policies 
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**Statistically significant difference; p<0 .01 . 
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Although CMOs are less likely than districts to mandate a specific behavior policy, CMO central 
office staff visit schools more frequently than do district office staff, as reported by principals 
(Figure III. 6.). More frequent visitation, an average of 1 to 4 times per month, might permit them to 
influence or monitor school climate in ways other than prescribing specific practices. 


Figure III.6. CMO Schools Report More Frequent Central Office Staff Visits 
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Source: Principal Survey. 

Although CMOs vary significantly on the composite measure of school-level behavior poUcy, 
there is also substantial variation in the behavior policies across schools within the same CMO.'® 
This variation may be expected given the tendency of CMOs not to mandate specific components of 
behavior and disciplinary strategies. An example of this lack of consistency within CMOs is 
illustrated in Figure III. 7. Among more than 60 percent of CMOs in the sample, at least one but not 
all surveyed principals reported requiring students and/or parents to sign responsibility agreements. 

The survey data suggests that in CMOs with the strongest average emphasis on schoolwide 
behavior policies, central office staff are either encouraging or requiring schools to adopt specific 
practices. Using the CMO central office staff survey, we examined CMO reports of behavior policies 
among the subgroup of CMOs scoring at or above the median CMO score on the schoolwide 
comprehensive behavior policy index, which was based on principal reports. These CMOs were 
most likely to report that they require student uniforms (94 percent) and that they require students 
or parents to sign a contract or letter of commitment (71 percent). 


The intraclass correlation coefficient for our composite measure of comprehensive behavior policy is 0.30. 
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Figure III. 7. Substantial Within- CMO Variation in Behavior Requirements 


Distribution of CMOs by Within- CMO Consistency of School Responses 
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Source; Principal Survey. 

CMOs whose schools emphasize enforcing behavioral expectations also tend to require longer 
instructional time. Of the 18 CMOs with values at or above the CMO median on a composite of 
behavior policy consistency, 78 percent (14 CMOs) also place in the top half of CMOs when ranked 
by instmctional hours.” The CMOs may seek to emphasize discipline both by defining and 
enforcing behavior policies and by requiring students to spend a long time at school. 

4. CMO Schools Emphasize Targeted Recruitment and Performance-Based 

Compensation 

One of our core hypotheses focuses on the extent to which CMOs reward teachers based on 
their performance. More broadly, we were interested in examining how CMOs hire, evaluate, and 
compensate classroom teachers; whether these practices differ from those employed by districts; and 
whether these staffing strategies vary within and across CMOs. 

CMOs appear to strive to hire and reward school staff who seem to be committed to the CMO 
mission and effective in the classroom. Relative to district schools, CMO schools place a 
significandy higher priority on screening teacher applicants based on their performance leading a 
sample class (p=0.03) and their expressed commitment to the school mission (p=0.01). CMO 
schools also appear to draw from a larger applicant pool than do district schools, on average: CMO 
principals report that the number of applicants per vacancy is approximately 60 compared to 23 for 
district principals (p<.01).'^ 


A Fisher’s exact test confirms that the distribution of CMOs on our behavior policy measure is significantly 
related to the distribution of CMOs on our instructional time measure (p=0.0002). 

These differences between CMO schools and district schools in hiring practices and applicant pool could be 
driven in part by differences in their certification requirements. While we have no data for districts, the extent to which 
CMOs hire teachers from alternative certification programs, as measured by our CMO central office staff survey, varies 
substantially; the mean proportion of teachers from Teach For America or Teaching Fellows programs is 17 percent but 
ranges from 0 to 55 percent. 
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CMO schools are significandy more likely than district schools to employ a system of 
performance-based compensation for both teachers and principals, on average/^ In particular, based 
on the principal survey responses, we found CMO-district differences in four out of four practices. 
First, while an average of 69 percent of principals in a CMO report using student test scores to 
evaluate teachers, only 46 percent of principals in a comparable group of district schools report 
doing so (p<.01). Second, compared to district principals, CMO principals report placing more 
importance on student test scores and teacher observations and less importance on tenure and 
education in determining teacher pay (p<.01) (Figure III.8). Third, the emphasis on performance- 
based compensation extends to principals as well: CMO principals are significantly more likely than 
district principals to have an opportunity to earn a bonus for student achievement (p<.01). Finally, 
CMO teachers are less likely to have an opportunity to earn tenure. Among CMO schools, an 
average of only 7 percent of principals report offering tenure, while 67 percent of district principals 
do (p<.001). 


Figure III.8. Student Test Scores and Observations More Important to Teacher Pay in CMO Schools 
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Source: Principal Survey. 


Although CMO school-level practices indicate a consistently strong emphasis on performance- 
based compensation and evaluation, it appears that their schools have some latitude in shaping these 
policies. Nearly aU (99 percent) district comparison principals report that the district is responsible 
for setting teacher salaries, but significandy fewer CMO principals (75 percent, on average) report 
that the CMO central office does so (p<.01). Similarly, nearly all district principals (97 percent) 
indicated that the central office requires a specific protocol for evaluating teachers, but only 81 
percent of CMO principals reported that their central office had a comparable policy (p<.01). This 
flexibility appears to result in some variation within CMOs, but CMOs stiU account for a substantial 


On a composite measure of emphasis on performance-based compensation and evaluation, the average CMO 
value was significantly higher relative to the district comparison (p<.001). 
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portion of the variation across CMO schools.'"^ Further, despite some variation within CMOs on 
specific approaches, there is substantial consistency within and across CMOs on one particular 
component: the use of student test scores to evaluate teachers (Figure III. 9). 

CMOs emphasizing performance-based compensation and evaluation may be applying a 
broader strategy focused on the use of student data. Indeed, the CMOs that evaluate and 
compensate their teachers based on their performance also ask teachers and principals to review and 
analyze formative assessment data more frequently (correlation of .43, p<.01). In addition, these 
CMOs tend to be more likely to apply a centralized instructional model and materials (correlation of 
.29, p=.09). The size and significance of these correlations stand out relative to the correlations of an 
emphasis on performance-based compensation with other practices. 

5. Frequent Teacher Coaching and Monitoring Emphasized Over Workshops 

In addition to hiring, compensation, and evaluation strategies, schools may also aim to increase 
teacher effectiveness via professional development. Professional development can be offered in a 
variety of ways, including in-service workshops, graduate-level coursework, and peer coaching. We 
examined both the centralization of professional development in CMOs and the frequency of 
teacher coaching and monitoring, exploring CMO-disttict school differences, variation across 
CMOs, and whether intense teacher coaching correlates with other practices. 

Eifl ure III.9. Consistent Use of Student Test Scores to Evaluate Teachers within CMOs 

Distribution of CMOs by Within- CMO Consistency of School Responses 


Fraction of Schools Reporting 
that Student Test Scores 
Influence Teacher Pay 

■ 1/3 or fewer 

■ Between 1/3 and 2/3 

■ 2/3 or more 




Source; Principal Survey. 


The intraclass correlation coefficient for our composite measure of emphasis on performance-based 
compensation and evaluation is 0.40. 
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At the school level, CMOs emphasize a higher intensity of teacher coaching and monitoring 
relative to nearby district schools. On a composite measure of principal reports on the frequency of 
observations, review of lesson plans, and provision of feedback for teachers, CMOs score 
significantly higher than the district schools (p=0.01). We find marginally significant (p-values fall 
between 0.05 and 0.06) differences between CMO and district school practice on three areas in 
particular. On average, CMO principals reported that the following practices were implemented in 
their schools with higher frequency relative to comparison district school principals: (1) observation 
of new teachers by principals or other administrators (Figure III. 10), (2) providing feedback from 
observations to new teachers, and (3) requiring new teachers to submit lesson plans for review. 

CMOs are distributed along a spectrum in the extent to which they coach and monitor teachers. 
A one-way analysis of variance confirms that there is significant variation across CMOs in the degree 
of intensity of coaching and monitoring. Figure III. 11 provides one example of this variation. 
However, we also found substantial variation in focus on teacher coaching and monitoring at the 
school level that is not primarily explained by being part of a particular CMO.^^ 

Figure 111.10. More Frequent Observation of Teachers by Administrators in CMO Schools 
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Source: Principal Survey. 


*5 The intraclass correlation coefficient for our composite measure of frequency of teacher coaching and 
monitoring is 0.30. 
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Figure III.1 1. More Frequent Submission of Lesson Plans for Review in CMO Schools 



■ CMO 

■ District 


Mean Annual Frequency of Submission of Lesson Plans 
for Review 


Source: Principal Survey. 

CMOs are reportedly less likely than nearby districts to provide certain types of professional 
development support to schools. Specifically, CMO school principals report significantly less central 
office provision of professional development, such as workshops and in-school service programs, 
for teachers outside the classroom than do district school principals (p<.01). However, although 
CMOs do not explicitly provide this type of centralized professional development, they appear to 
seek to shape teacher practice in a different way — through intense teacher coaching and monitoring 
to develop like-minded staff. 

Our analysis of the associations among the practices that comprise our primary hypotheses 
indicates that CMOs that provide a substantial amount of coaching may fall into two categories. As 
shown in Table III.3, CMOs that emphasize teacher coaching are also more likely to encourage or 
require school staff to frequently review and analyze formative assessment data. We also find a 
strong and significant relationship between the intensity of teacher coaching and the number of 
instructional hours in the school year. 


We find a positive and marginally significant association between intensity of teacher coaching and monitoring 
and the provision of centralized professional development support (correlation: 0.29, p=.08), however, which indicates 
that the two approaches could be complementary. 
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Table III. 3. Intense Teacher Coaching Highly Correlated with Formative Assessment Use and 
Instructional Time Correlations between Teacher Coaching and Other Practices (Pearson Correlation 
Coefficient) 



Frequent Use of 


Instructional 


Formative 

Use of School- Wide 

Hours in School 


Assessments 

Behavior Policies 

Year 

Intense Teacher Coaching 

Q ^ 1 *** 

0.31* 

0.58*** 


Source: Principal survey. 

* p< .1 0; **p<.05; ***p<.01 . 

C. Ways of Categorizing CMOs 

CMOs that tend to adopt one practice often adopt other practices that are consistent with their 
overall educational strategy. For example, as noted above, some CMOs that provide intensive 
coaching also employ formative assessments frequently to help teachers determine which skills or 
topics to prioritize. In addition to correlating pairs of practices, we used cluster analysis to explore 
whether CMOs can be categorized based on the mix of practices they employ; examining whether 
CMOs bundle certain practices may provide insight into their strategic approach. Below we discuss 
two ways to categorize CMOs, based on (1) the prescriptiveness of their policies and (2) the core 
practices that define our primary hypotheses regarding the drivers of impacts.'^ 

1. Four Groups of CMOs Based on Extent and Form of CMO Prescriptiveness 

We explored the extent to which CMOs centralize decision-making or delegate authority to 
school principals using several items from the principal survey. Specifically, we used responses to 
survey questions that address whether CMO or school staff typically make key decisions relating to 
three broad areas: (1) curriculum/instructional approach, assessment, and professional development; 
(2) teacher evaluation and compensation; and (3) behavior policy.^® With respect to these three 
broad dimensions of centralization, cluster analysis indicates that CMOs fall into four categories.' 
Group A CMOs tend to be very centralized in decision making only about policies and practices 
related to instmction (that is, educational approach, professional development, and formative 
assessments). Group B CMOs tend to be highly prescriptive or centralized in decision-making 
across all dimensions measured. Group C CMOs are very prescriptive with regard to behavior policy 


We conducted a hierarchical cluster procedure in SAS using Ward’s minimum variance method, in which the 
distance between clusters is defined as the ANOVA sum of squares between two clusters, summed over the variables 
specified. At each step in the procedure, clusters from the previous step are merged so as to minimi 2 e the within-cluster 
sum of squares. Although we were interested in the potential grouping of CMOs on dimensions of prescriptiveness and 
school-level practices, we found no pattern of correlations of our defined core practices with prescriptiveness. 
Therefore, we explored CMO centralization as a separate domain for the purposes of our cluster analysis. 

We created five composite measures of prescriptiveness using principal survey items pertaining to decisions 
made or resources provided by the CMO to its schools: (1) CMO-prescribed educational approach, (2)centralized 
provision of professional development, (3) centralized policy on teacher evaluation and compensation, (4) CMO- 
prescribed formative assessment system, and (5) centralized behavior policy. From these composite measures, we 
developed three vectors of prescriptiveness or centralization using principal components analysis: (1) prescriptiveness on 
instructional policies, (2) centralized policy on teacher evaluation and compensation, and (3) centralized behavior policy. 

One outlying CMO does not appear to fit other patterns of prescriptiveness. 
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and decentralized elsewhere, on average. Group D CMOs appear to be very decentralized in 
decision making across all dimensions, on average. 

These group typologies are depicted in Table III.4. Each row indicates one of the dimensions 
of principal-reported prescriptiveness, while the mean values and corresponding rankings across 
these dimensions are reported by the group in each column. The CMO and district comparison 
means are provided for reference. The column for Group A, for example, illustrates that those 
CMOs tend to be highly prescriptive and rank first in their degree of centralization of teacher 
evaluation criteria and compensation on average, while they are substantially less apt to mandate 
practices in the other areas measured. 


Table III.4. Prescriptiveness Varies Across CMOs and Across Dimensions Within CMOs”. 2 i 


Prescriptiveness Components 

Croup A: 
Tight 

Instruction 

Croup B: 
Tight 

Croup C: 
Tight Evaluation 
and Compensation 

Croup D: 
Loose 

CMO Mean 

Comparison 

Mean 

Instructional vector 
(educational approach, 
professional development, 
and formative assessments) 

3rd (-0.68) 

1st (0.71) 

2nd (0.25) 

4th (-2.07) 

-0.43 

0.42 

Teacher evaluation criteria 
and compensation 

1st (0.42) 

2nd (0.32) 

3rd (-1.07) 

4th (-1.96) 

-0.45 

0.39 

Behavior policy 

2nd (-0.44) 

1st (1.21) 

3rd (-1.12) 

4th (- 1.43) 

-0.31 

0.31 

N 

13 

5 

8 

4 




Source: Principal Survey. 

2. Four Clusters of CMOs Defined by Core Practices 

In addition to examining groupings of CMOs based on the tightness of their management 
approach, we also explored the extent to which CMOs fall into groups defined by their bundling of 
particular practices. Based on implementation of the core measures of school-level practices 
corresponding to six of our primary impact hypotheses, the cluster analysis suggests that CMOs can 
be categorized into four clusters 

/. “Incremental Innovation” Cluster, the largest cluster, is composed of CMOs that 
deviate the least from the district means across the six core practices, on average. This 
large group appears to include two subgroups that, while relatively similar on most 


The values for each of these composite measures are standardized across CMOs and district comparison groups 
such that the mean is 0 and the standard deviation is 1. 

The outlier CMO not included here has the following mean values; -0.98 on the instructional vector, -4.66 on 
teacher evaluation and compensation, and 1.87 on behavior policy. 

The tree diagram depicted the hierarchical clustering procedure indicated that the number of clusters could range 
from three to five. An examination of the means by cluster and the CMO values within each cluster indicated that a 
specification of four clusters was the most meaningful, with a combination of separation between clusters and 
consistency within clusters. 

That is, this cluster includes the fewest dimensions that deviate from the district mean by more than 1 standard 
deviation. 
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dimensions measured, diverge in the extent to which they centralize their educational 
approach. 

2. “Data Driven” Cluster, the next largest cluster, is composed of CMOs that are most 
likely to implement performance-based compensation and evaluation and use formative 
assessment data most frequently. They also emphasize teacher coaching and tend to 
centralize instmction materials and methods. 

3. “Time on Task” Cluster is composed of CMOs that tend to emphasize comprehensive 
behavior policies at the school level and require the most instructional hours for their 
students. In addition to ranking first in those dimensions. Table III. 5 shows that they 
also tend to coach and monitor teachers most frequently. 

4. “Alternative Approach” Cluster is composed of CMOs that tend not to consistendy 
emphasize the key practices measured by the principal survey, ranking last among the 
clusters on these dimensions; these CMOs may focus on other strategies we have not 
captured. 

The specific practices differentiating these clusters are shown in Table III. 5. As in 
Table III.4, each row indicates one of the core practice measures, while the mean values and 
corresponding rankings across these dimensions are reported by the group in each column. 


Table III. 5. Core CMO Practices: Rankings and Means by Cluster 



Cluster 1 
Incremental 
Innovation 

Cluster 2 
Data Driven 

Cluster 3Time 
on Task 

Cluster 4 
Alternative 
Approach 

CMO 

Mean 

Comparison 

Mean 

Consistent behavior 
policy 

2nd (0.35) 

3rd (0.16) 

1st (1.26) 

4th (- 2.02) 

0.30 

-0.30 

Frequent use of 
formative assessments 

3rd (-0.03) 

1st (0.68) 

2nd (0.1 1) 

4th (- 2.85) 

0.03 

-0.03 

Consistent educationai 
approach 

3rd (-0.47) 

1st (0.64) 

4th (-1.36) 

2nd (-0.45) 

-0.31 

0.30 

Frequency of teacher 
coaching/mentoring 

3rd (-0.05) 

2nd (0.91) 

1st (1.14) 

4th (-1.55) 

0.30 

-0.30 

Performance- based 
compensation 

3rd (0.42) 

1st (1.19) 

2nd (0.61) 

4th (-0.32) 

0.60 

-0.62 

Yeariy instructionai 
hours 

3rd (-0.16) 

2nd (0.76) 

1st (2.28) 

4th (-0.45) 

0.41 

-0.42 

Number of CMOs in 
cluster 

19 

9 

5 

2 




Source: Principal Survey. 

Although Table III. 5 shows that patterns in the implementation of key practices help to define 
the clusters of CMOs on average, scatter plots illustrate that individual CMOs within each cluster are 
indeed clearly differentiated on a few particular dimensions. In other words, these scatter plots offer 
evidence that, on the key dimensions, the means used to define the clusters are not masking 
substantially different individual CMO values within those clusters. Figures III. 12, III. 13, and III. 14 
show that clusters of CMOs are most differentiated in their emphasis on performance-based 
compensation and formative assessment use, behavior policy and instmctional time, and teacher 
coaching and monitoring. 
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CMOs within the Data Driven cluster are consistent in their focus on formative assessment 
data and performance-based compensation. Figure III. 12 plots CMO values on our composite 
measure of the use of formative assessment data versus values on our composite measure of the 
emphasis on performance-based compensation and evaluation; values farther to the right on the x- 
axis indicate greater frequency in reviewing formative assessment data, while values higher on the y- 
axis indicate a stronger emphasis on performance-based compensation and evaluation. The points 
shown as red squares depict the consistency of the Data Driven CMOs in emphasizing both 
practices. We can see CMOs from other clusters drifting to the upper right-hand corner of the plot, 
showing that an emphasis on these more data-focused practices is not entirely absent among other 
CMOs. However, the CMOs in the Data Driven cluster are most alike in their bundling of the 
dimensions shown. 


Figure 111.12. Data Driven CMOs Emphasize Frequent Use of Formative Assessment Data and 
Performance- Based Compensation 



Source: Principal Survey. 


CMOs in the Time on Task cluster are all consistendy differentiated in their emphasis on 
comprehensive school- wide behavior policies and longer instruction time. Figure III. 13 plots CMO 
values on our composite measure of a comprehensive school-level behavior policy versus values on 
instructional time. The Time on Task CMOs (identified by the green triangles) are concentrated in 
the upper right-hand quadrant, while CMOs in other clusters tend to fall below and to left. 

As noted earlier, both the Data Driven and Time on Task CMOs emphasize coaching. 
Figure III. 14 plots CMO values on our composite measure of a comprehensive school-level 
behavior pohcy versus values on our composite measure of intensity of teacher coaching and 
monitoring. The red squares and green triangles identifying the Data Driven and Time on Task 
CMOs are arrayed towards the top because of the emphasis CMOs in these clusters place on teacher 
coaching. The Time on Task CMOs are concentrated to the right because of their emphasis on 
behavior policy. 
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Figure 111.13. Time on Task CMOs Maximize Instructional Time and Emphasize Comprehensive 
Behavior Policies 
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Source: Principal Survey. 

Figure 111.14. Both Data Driven and Time on Task CMOs Engage in More Frequent Teacher Coaching 
and Monitoring 
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Source: Principal Survey. 

Although the CMOs within the Data Driven and Time on Task clusters are very consistent in 
their emphasis on the dimensions that typify their clusters, there is substantially more variation 
within our largest cluster — the Incremental Innovation cluster. Our largest group of CMOs is least 
distinguishable on any practices; these CMOs deviate the least from districts on these dimensions, 
on average. That said, the plotting of specific data points for each CMO in the group indicates 
substantial variation within this cluster on one particular practice: the extent to which schools rely 
on CMOs to centralize instruction and curriculum. In Figure III. 1 5, the subgroup designated as 
Incremental Innovation - B consists of CMOs that tend toward decentralization or more school- 
level autonomy in educational approach, in contrast to the CMOs in the Incremental Innovation - A 
subgroup, which take a more centralized CMO-level approach. 
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Figure III.1 5. Variation Within Incremental Innovation Cluster on Educational Approach 


A 



TT? — ■ 

1 




A 

A ' 

A 

V 

C\ C . 


♦ 








♦ 

2 k\ 

.5 

♦ 

1 A-( 

♦05- 

♦ S 


1 

A 



♦ 







1 C . 

♦ 






< 

T . 

• 






3_C_. 


♦ 



5 


Educational Approach 


♦ Incremental Innovation - A A Incremental Innovation - B 


Source: Principal Survey. 

A central question for this study is whether and how CMO practices contribute to greater 
student achievement. Chapter V examines the relationship between CMO practices and impacts on 
student achievement. The mechanism through which practices might affect students could be either 
direct (for example, extending the time allocated for academic instruction) or indirect (for example, 
coaching teachers on ways to collaborate with one another or make use of student assessments). In 
the next section, we examine two intermediate school level outcomes — instructional coherence and 
organizational health — that could mediate the indirect relationships between CMO practices and 
CMO impacts. 

D. Instructional Coherence and Organizational Health of CMO Schools 

If CMOs have a positive effect on student achievement, the effect must be mediated by 
something happening in their schools, which have direct contact with students. What is it about 
affiliation with a CMO that could increase school performance? Our descriptive analysis of CMO 
school practices suggests that at least some CMOs encourage or assist their schools in coaching and 
evaluating teachers, extending instructional time, and fashioning school-wide behavioral strategies. 
In addition, previous research indicates that new charter schools often struggle to unite all the parts 
of their instructional programs, and that principals are sometimes pulled away from instmctional 
leadership by crises involving facilities, finance, compliance, and staffing (HiU and Rainey 2010). 

At the outset of the study, we hypothesized that successful CMOs might address the challenges 
faced by new charter schools in at least two ways. First, CMOs could promote instmctional 
coherence — helping schools ensure that the different aspects of their instructional program 
complement and reinforce one another. Some studies have suggested that instmctional coherence is 
related to student achievement (Newman et al. 2001). Second, by reducing the administrative duties 
of principals and stabilizing the working environment for teachers, CMOs could increase schools’ 
organizational health. 

In order to examine instmctional coherence and organizational health in CMO schools, we first 
developed measures of these outcomes using our survey data. The instmctional coherence measures 
rely entirely on the teacher survey and hence are available only for the CMO schools. The 
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organizational health measures, with one exception, rely on the principal survey, which allows us to 
compare CMO and district schools. Using these survey-based measures, we explored whether 
specific CMO practices are associated with higher levels of either instmctional coherence or 
organizational health in schools. 

Building on work done by the Consortium on Chicago Schools Research (Newman et al. 2001), 
we developed two multi-item measures of instmctional coherence: 

• Use of a common instructional framework with consistent focus, pacing, 
standards, and formative student assessments, based on teacher survey items 
addressing perceptions of the extent to which instruction and curriculum are aligned 
within and across grades, clarity and consistency of learning standards, and modification 
of lesson plans based on formative assessment data 

• A cooperative and supportive environment for teachers, based on perceptions of 
staff cooperation and administrative support as reported in the teacher survey 


We also developed and analyzed five sets of organizational health measures: 

• High teacher job satisfaction, as reported in the teacher survey 

• Administrative staff stability, based on principals’ reports of turnover in the last three 
years and the need for recent leadership on an emergency basis 

• Average number of applications per open teaching position during the past two 
years, as reported in the principal survey 

• Low student turnover, based on school attrition rates as reported by principals 

• Minimal legal, financial, and other administrative challenges, based on principal- 
reported time spent addressing issues related to finances/payroll, facilities leases, and 
building or grounds maintenance 

In this section, we examine which CMO strategies and actions, if any, are correlated with 
instructional coherence and organizational health. We also explore differences between CMO and 
district schools on some of our measures of organizational health subcomponents. 

1. Instructional Coherence appears to be high when teachers are observed frequently 

As established via our case studies, CMOs encourage intensive teacher monitoring and 
coaching, intended to help teachers implement the CMOs’ classroom management strategies and 
focus instruction on skill areas identified as priorities via formative assessment of students. 

A strategy of intensive teacher coaching and monitoring, as reported in our surveys, appears to 
be associated with high levels of instructional coherence. CMO-affiUated schools whose teachers 
report that they are observed frequently score high on both composite measures of instmctional 
coherence, and schools whose teachers report receiving frequent guidance from their principals also 
score high on our measure of a collaborative and supportive environment. These findings are 
confirmed when we use measures of coaching from the principal and CMO central office staff 
surveys. 


43 


III. CMO Practices and School Outcomes 


2 . CMO Principals Appear To Spend Less Time on Administrative Duties Than Do 

District Principals 

One pattern common in charter schools that CMOs seek to address is the need for principals 
to devote so much time to addressing administrative, compliance, and human resource management 
challenges that they cannot be instmctional leaders. CMO central offices negotiate leases and 
manage compliance reporting in order to minimize principal burden. They also provide day to day 
help with hiring, personnel management, and data analysis, which is intended to allow principals to 
focus on classroom monitoring and assistance to teachers. 

Indeed, surveys of CMO and district principals indicate that CMOs may be successful in 
minimizing administrative challenges for their principals. Compared to principals of district-mn 
public schools serving similar students, CMO principals report that they spend less time resolving 
issues related to payroll and building and grounds maintenance (an average of less than once per 
month by CMO principals compared to one to five times per month by district principals, p<.01). 

3. Principal Turnover Is Lower Where CMOs Provide Professional Development and 

Higher Where CMOS Prescribe the Curriculum 

Principal turnover is a major challenge for charter schools and can be highly dismptive to 
instruction, staff morale, and parent confidence. Some CMOs have sought ways to make the school 
leadership positions more manageable in order to minimize principal turnover. In addition, some 
CMOs have tried to build leadership pipelines and identify possible successors in an effort to 
preemptively reduce the impact of principal turnover. 

CMO actions are linked to principal turnover, in both predictable and puzzling ways. CMOs 
that support their schools by providing access to professional development, including workshops or 
in-service training programs, have lower principal turnover compared to other CMOs, on average. 

However, CMOs that are more prescriptive about the educational approach a school must take, 
as reported by principals, also experience more principal turnover, on average. This finding is one 
of a set of surprising results about the links between CMO actions and organizational health. 
Organizational health is negatively correlated with teacher coaching and professional development 
support provided by the CMO, frequency of CMO staff visits to the school, and the use of sample 
lessons in teacher hiring. These results may reflect a reversal of cause and effect: CMOs may 
undertake these actions when they believe a school is in trouble. Alternatively, the association could 
be driven by a third unobserved factor. 
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IV. CMO SCHOOLS’ IMPACTS ON STUDENTS 


Key Findings 

• Impacts on student achievement vary a great deal across CMOs, 
ranging from substantially positive to substantially negative. 

• Impacts of individual CMOs are more often positive than negative, 
and in some instances are large. 

• Overall average impacts appear to be positive, but they are not 
statistically significant. 

• CMOs that exhibit positive test score impacts in one subject tend to 
also exhibit positive impacts in other subjects including math, 
reading, science and social studies. 

• Several CMOs appear to have larger math and reading impacts for 
Hispanics than for other students, but impacts do not seem to vary 
appreciably for subgroups defined by gender, baseline test scores, or 
free and reduced-price lunch eligibility. 


A. Introduction 

As noted in the introductory chapter, an extensive body of research suggests that variation in 
the performance of charter schools is wide but that high-performing charter schools can produce 
substantial positive achievement effects for their students. CMOs represent an attempt to produce 
the effects of high-performing charter schools on a larger scale. To date, little research has examined 
their success in doing so. 

In this chapter, we report our estimates of the effects of CMOs on test scores, examining not 
only average effects across all CMOs but also the variation in effects among CMOs. The CMO 
(rather than the school) is the key unit of analysis for the study, and variation of impacts among 
CMOs is of particular interest given the variation in CMO practices reported in the survey results 
described in the preceding chapter. Although much of the existing research on charter schools 
focuses on math and reading impacts, there is also interest in the test score impacts for other 
subjects that typically have not been the focus of high-stakes state accountability systems. We 
therefore estimate impacts on science and social studies tests (where available) as well as reading and 
math tests, and we examine the extent to which the impacts of CMOs are correlated across subjects. 
The rapid growth and substantial philanthropic investments in CMOs discussed in Chapter II beg 
the question: do successful CMOs attract more funding, which then leads to organizational 
growth? We test this hypothesis by looking at the associations between impacts and measures of 
CMO size and growth. A related question is whether CMOs are able to expand without 
compromising the effectiveness of both their existing and new schools. To answer this question, we 
conduct a series of within-CMO analyses to gauge whether individual CMOs’ impacts tend to get 
larger (or smaller) as they grow. Following up on prior studies of charter schools (Gleason et al. 
2010; Angrist et al. 2011) that have found suggestive evidence of greater benefits for low-income 
minority students in urban areas, we examine whether CMOs benefit certain subgroups of students 
differentially. 
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More specifically, this chapter addresses the following research questions: 

1. How much do CMOs vary in their effects on student test scores in math, reading, 
science, and social studies? 

2. What are the average effects of CMOs on student test scores in math, reading, science 
and social studies? 

3. Are CMO test score impacts correlated across subjects within CMOs? 

4. Do successful CMOs grow more than less successful CMOs? 

5. Are CMOs able to sustain their impacts as they grow? 

6. Are particular subgroups of CMO students differentially impacted by CMOs? 

As we describe in detail in the methods section below, our ability to estimate CMO 
achievement effects depends on having “pretreatment” test scores for CMO students and 
comparison students. This means, unfortunately, that we cannot estimate the achievement effects of 
CMO elementary schools. No data are available on the math and reading achievement of five-year- 
olds before they enter kindergarten. In most states, standardized testing does not begin until the end 
of third grade, a full four years after the beginning of elementary school. As a result, there are no 
proven and generaUzable methods for estimating impacts of charter elementary schools. (Indeed, we 
ares not aware of a good nonexperimental method for estimating impacts of anj elementary 
schools — a problem that has not been sufficiendy recognized by researchers or poUcymakers.) 

High school impact analyses can, in contrast, make use of pre-entry achievement data, but they 
pose other challenges related to a limited number of standardized tests, high rates of grade 
repetition, and (in many places) tests that are course specific rather than grade specific. In a follow- 
up report, we will examine the effects of CMO high schools, focusing on the long-term attainment 
measures of high school graduation and college entry. For the current report, however, the impact 
analyses focus on the achievement effects of CMO middle schools. 

We begin the chapter by describing our analysis sample, specifying the CMOs and the number 
of students included in our analyses and defining the outcomes of interest. We then describe our 
approach to estimating impacts. We also discuss several threats to the validity of our impact 
estimates and how we address them. Finally, we present impact findings in math, reading, science, 
and social studies for middle school grades in 22 CMOs across the country. 

B. Data and Scope of CMOs and Students Included 

Among 26 CMOs across the country that have middle schools and met the eligibility criteria for 
the study (described in Chapter 1), the study obtained sufficient state and district school records data 
to estimate impacts for 22 (85 percent). The impact estimates cover 68 CMO middle schools (81 
percent of all of the eligible middle schools). The analyzed CMO schools span a total of eight states 
(including 16 metropolitan areas and two rural districts) located in the West, Southwest, Midwest, 
and Mid-Atlantic regions. 

The impact analyses use school records data obtained from states and districts. The outcomes 
are grade- specific statewide assessments in math, reading, science, and social studies. All pre- 
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baseline, baseline, and outcome test scores were converted to standard deviation units, also known 
as z-scores.^ AU districts provided data on test scores (in at least reading and math and sometimes in 
science and/or social studies), race, gender, and school enrollment. In addition, a majority of 
districts also provided data on English language learner (ELL) status, special education status, and 
free and reduced-price lunch status. 

To be included in the analysis sample, CMO students had to enter the relevant CMO school in 
the school’s normal intake grade,^ have a baseline test score in at least reading or math, and have a 
follow-up test score for the relevant achievement test. The total number of CMOs, CMO students, 
and matched comparison students associated with each outcome is shown in Table IV.l.'^ 


Table IV.l. Achievement Analysis Sample Sizes for CMOs 


Outcome 

Measure 

After 1 year of 
treatment 

After 2 years 
of treatment 

After 3 years 
of treatment 

Math 

Number of CMOs 
Number of CMO students 
Number of comparison students 

22 

18,606 

321,296 

22 

13,434 

237,490 

14 

5,747 

121,050 


Number of CMOs 

22 

22 

20 

Reading 

Number of CMO students 

18,769 

13,674 

8,131 


Number of comparison students 

325,063 

242,946 

1 59,945 


Number of CMOs 

N. A. 

N. A. 

1 1 

Science 

Number of CMO students 

N. A. 

N. A. 

3,803 


Number of comparison students 

N. A. 

N. A. 

72,121 


Number of CMOs 

N. A. 

N. A. 

9 

Social 

Studies 

Number of CMO students 

N. A. 

N. A. 

3,529 


Number of comparison students 

N. A. 

N. A. 

69,751 


Source; State, district, and CMO school records. 


We focus most of our attention on the cumulative impacts for two years after CMO entry. 
These have the advantage of including more than just one year of treatment while still including all 
22 CMOs. Science and social studies estimates, however, must be based on impacts after three years 


' The 2 -score was calculated as the student’s scaled score minus the mean scaled score for aU students In a given 
population taking the test in the same year and grade and then divided by the standard deviation of scores for that same 
group. For 11 of the 22 CMOs, 2 -scores were calculated using the statewide mean and standard deviation of scores for 
each test. In cases where the statewide mean and standard deviation could not be derived from the data or obtained 
through published sources, 2 -scores were calculated using the distribution of scores In the district-level data files 
provided to the study. 

^ The analytic sample did not Include CMO students who enrolled for the first time after the normal Intake grade 
In a CMO school. However, for the sample of analy 2 ed schools at all 22 CMOs, students who enrolled In the normal 
Intake grade represent the majority of total student enrollment during middle school. 

3 Table IV.l shows the number of unique CMO and matched comparison students analy 2 ed for a given outcome. 
To account for data attrition, whenever a CMO student was missing data for a given outcome, all of the matched 
comparison observations for that student were also dropped from the sample for that outcome. 
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because most states do not test those subjects in earlier grades and so we cannot examine impacts 
after two years/ The science and social studies impacts are based on smaller numbers of CMOs. 

C Methods for Estimating Impacts of CMOs 

Producing rigorous and broadly applicable measures of the effects of CMOs (or of charter 
schools more generally) is methodologically challenging. The challenge is inherent in the 
intervention: Charter schools are by definition schools of choice, which means their students may 
differ from students in conventional public schools in ways that are not readily apparent (for 
example, in student or parent motivation). Failure to account for these differences could lead 
researchers astray and thus produce biased estimates of CMO impacts. 

The best way to ensure the validity of impact estimates is to conduct a randomized experiment. 
Properly designed and implemented randomized experiments produce impact estimates that support 
stronger causal conclusions than any other method does by ensuring that the treatment and control 
groups are similar on all observed and unobserved characteristics prior to receiving an intervention. 
Thus, any significant difference between group outcomes can be attributed to the impact of the 
intervention. In the charter school context, we can sometimes perform randomized experiments 
using the lotteries that oversubscribed schools conduct to determine who will be admitted (see, e.g., 
Angrist et al., 2011; Dobbie and Fryer, 2011; Gleason et al., 2010; Hoxby and Murarka, 2009). 

However, not all charter schools are oversubscribed, not all oversubscribed schools use 
lotteries, and not all schools using lotteries keep good records of winners and losers. In conducting 
several charter-school studies (see, for example, Gleason et al. 2010; Tuttle et al. 2011), Mathematica 
researchers have found that admissions lotteries can be used for experimental analysis in only a small 
proportion of charter schools nationwide. Of the 292 CMO schools initially targeted for inclusion in 
our study, data adequate for an experimental analysis were available in only 1 6 schools and only for 
select grades and cohorts of those schools. Moreover, the oversubscribed schools where 
experimental analysis is possible might be different from the schools where it is not possible (Tuttle 
et al. 2010; Abdulkadiroglu et al. 2009). Oversubscribed charter schools might be oversubscribed 
because they are more effective than non-oversubscribed schools. In sum, admissions lotteries can 
be used to conduct experiments producing strong causal inferences about a small subset of CMO 
schools, but they cannot be used to examine the impacts of large numbers of CMO schools across 
the country. 

Two primary aims of this study are to determine the average effectiveness of CMOs and to 
assess and understand the diversity of impacts among CMOs. To address these questions it is critical 
to estimate the effects of as many CMOs and CMO schools as possible. We therefore require a 
method that, unlike experimental or lottery-based methods, can be used for large numbers of 
schools. We use a nonexperimental panel (NXP) approach, which involves first identifying a 
matched comparison group of non-CMO students who are similar to the CMO students 
immediately prior to CMO enrollment and then following the achievement trajectories of individual 
students in both groups for several years. NXP methods for estimating CMO impacts rely on 
longitudinally linked data on individual students before and after they enter CMO schools. Our 
preferred NXP approach, propensity score matching (described below), involves comparing the 


'' Most states in our sample administer grade-specific science and/ or social studies tests in grade 8, although there 
are some states that also administer these tests in other middle and high school grades. 
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achievement of CMO students to non-CMO students who have been identified based on the 
similarity of their baseline achievement (prior to entering the CMO school) and other characteristics. 
Although NXP methods lack the strong causal validity of randomized experiments because matched 
students may differ on unobserved characteristics, they allow us to include many more schools and 
CMOS. 

Test scores measured prior to CMO entry are critical to our NXP approach. Prior research in 
various topical areas has demonstrated that NXP methods can replicate the findings of randomized 
experimental studies if the researchers have a good pretreatment measure of the outcome of interest 
(Glazerman et al. 2003; Cook et al. 2008). In addition, our study provided an opportunity to direcdy 
test the NXP methods against experimental results in the subset of CMO schools for which 
admissions lottery data are available and usable. The success of this test — described later in this 
chapter and in more detail in Appendix B — confirms that in the CMO context, nonexperimental 
methods can reproduce rigorous experimental findings if they can make use of pretreatment 
measures of relevant outcomes along with other student-level covariates that are widely available in 
administrative data. 

1. Propensity Score Matching Identifies a Nonexperimental Comparison Group 

To obtain a matched comparison group, we use a propensity score matching (PSM) procedure. 
The central concept of the PSM approach is to estimate the probability of being in the treatment 
group using observed data for treatment and potential comparison group students (Rosenbaum and 
Rubin 1983). This probability is known as the propensity score — here, the likelihood of enrolling in 
a CMO middle school. Theoretically, if the propensity score succeeds in removing aU unmeasured 
differences between the two groups that are related to test scores during the follow-up period, 
matching on that propensity score would result in an unbiased estimate of the impact of treatment. 
We use various student characteristics available in school records — most prominently, a student’s 
prior test scores — to estimate each student’s propensity to enter a CMO middle school. For details 
on the propensity model selection, see Appendix C. 

After estimating the propensity scores, the next step is to select a matched comparison group of 
students whose estimated propensity scores are similar to those of treatment group students. To 
improve statistical precision, we selected multiple matches for each treatment student, increasing the 
total sample size. To ensure the quality of the matches and reduce bias, we matched with 
replacement (allowing each comparison student to match to more than one CMO student) and 
implemented caliper matching, whereby a given treatment student is matched to aU comparison 
students with estimated propensity scores within a specified range (caliper) rather than merely a 
specified number of nearest neighbors. The matching procedure is implemented separately for each 
grade, cohort, and site combination. For each CMO, we were able to match between 64 percent and 
100 percent of CMO enroUees in the primary intake grade who had at least one valid outcome, with 
a match rate of at least 90 percent for most of the test scores that we rely on as key outcomes.^ We 
were able to obtain matched comparison groups of students who were equivalent in terms of 


5 Match rates were 90 percent or better for 17 of the 22 CMOs on two-year reading and math outcomes, for 8 of 
the 11 CMOs on the three -year science outcome, and for 6 of the 9 CMOs on the three -year social studies outcome. 
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baseline test scores, gender, race, and free or reduced-price lunch status to students in the treatment 
groups. (For details on the matching procedure see Appendix C.) 

2. Statistical Regression Controls for Remaining Differences in Estimating Impacts 

Following the creation of matched samples, we employ a regression model when estimating 
impacts to improve statistical precision and to control for any remaining differences in baseline 
characteristics. Our CMO-specific impact regression models include pre-baseline (i.e., two grades 
prior to CMO entry) and baseline test scores in reading and math; corresponding missing test score 
indicators; and other student characteristics, including race/ ethnicity, poverty status (as measured by 
eligibility for free or reduced-price lunches), disability status, and ELL status.^ 

The impact for the average CMO is calculated by averaging the CMO-specific impacts for a 
given outcome, and by treating the CMOs in our analysis sample as representing a broader 
population of CMOs. We also examine whether CMO-specific impacts vary by student’s sex, 
race/ethnicity, eligibility for free or reduced-price lunch, prior achievement, and the number of 
schools that CMO operates in a given year by including an interaction between the subgroup 
indicator and treatment group status in the CMO-specific impact model. 

3. We Account for Selective Attrition and Grade Repetition 

Even if PSM successfully identifies comparison students who are similar to CMO students in 
relevant respects prior to CMO entry, estimates of impact in the long term could be biased if 
students who are struggling academically are more likely to exit the CMO schools. We address this 
possibility by keeping all CMO students permanently assigned to the treatment group even if they 
exit the CMO after their first year (or later). This is analogous to an “intent-to-treat” analysis in the 
experimental context.* In essence, we avoid the possibility of selective attrition by preventing 
transferring students from leaving the treatment condition. This means that our impact estimates 
will be conservative, in the sense that they underestimate the full effect of CMO enrollment on the 
students who remain enrolled. 


^ The sample of analy2ed CMOs and CMO students varies by test subject and outcome year for two reasons. First, 
some cohorts are observed for only one or two follow-up years. As a result, the sample for three -year impacts includes 
fewer student cohorts than do the samples for one-year or two-year impacts. Second, individual students may have 
missing outcome data for other reasons. On average, approximately five percent of the sample observed in the first 
follow-up year is missing data in the second year, and eight percent is missing data in the third year. When a CMO 
student disappears from the data, the matched comparison students are also dropped (and vice versa). As a result, the 
study’s CMO and non-CMO samples maintain baseline equivalence in their pre-entry test scores for aU outcome years 
and test subjects, and most maintain equivalence on race, gender, and free and reduced-price lunch status as well (see 
Appendix D for a more detailed description of baseline equivalence results). 

^ Not all student variables are available in all jurisdictions. We use weights that adjust for the distribution of 
treatment and comparison students across matching strata that are defined by grade, cohort, and district so that groups 
of students belonging to each unique grade, cohort, and district combination contribute equally to our impact estimates. 
We use robust standard errors that adjust for clustering of students within schools. 

® The analogy is imperfect because some students may exit the CMO schools in their first year of enrollment, prior 
to testing and before we capture them in the treatment group. In general, we cannot identify transfers that occur prior to 
testing. 
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Student grade repetition also creates analytical challenges: A student who repeats a grade takes 
different state end-of-year tests than other students in her/his original cohort take. Most state 
assessments are not vertically scaled, which prevents us from using observed test scores in a given 
academic year to compare the achievement of retained students to students in the same cohort who 
were promoted. If a CMO has a policy of retaining low-performing students at a higher rate than the 
other public schools, ignoring the attrition of students due to grade retention may result in biased 
CMO impacts. Following Tuttle et al. (2011), we address this issue by assuming that the retained 
students will perform at the same level relative to other students in their cohort as in the year prior 
to being retained. If a CMO has a higher rate of grade repetition and positive impacts (and if the 
impacts are positive even for the grade repeaters), this assumption will tend to underestimate true 
impacts. For more information on grade retention, see Appendix E. 

4. The Matching Method Successfully Replicates Rigorous Experimental Impact 

Estimates 

As mentioned earher, we validated our PSM approach against benchmark experimental 
estimates in the same CMOs. This validation effort, conducted for a subset of CMO schools^ in 
which experimental analyses could also be conducted, finds that PSM produces impact estimates 
that are very similar to the benchmark experimental impact estimates. In both reading and math, 
PSM estimates are statistically indistinguishable from experimental estimates, and the point estimates 
differ by very small amounts (0.01 to 0.03 standard deviation units). Site-specific estimates across 
seven lottery sites indicate that PSM results correlate with experimental results at levels of 0.9 or 
higher. These results provide evidence supporting the rigor of nationwide NXP/PSM impact 
estimates for the larger set of CMO schools where experimental methods are not feasible. (See 
Appendix B for more detail.) 

D. CMOs’ Impacts on Middle School Test Scores 

1. After Two Years of Enrollment, CMO Impacts on Students’ Reading and Math 

Achievement Are More Often Positive than Negative 

The extent of variation in CMO impacts is potentially important for two reasons. First, 
policymakers and funders are interested in the extent to which CMOs show uniformly positive 
impacts because this could inform whether and how they seek to encourage the development of the 
most successful CMOs. Second, variation in impacts provides a basis for exploring factors that 
might contribute to this variation, a topic examined in the next chapter. 

The test score impacts discussed in this chapter are cumulative impacts. In other words, two- 
year CMO impacts, for example, refer to the total CMO impact on students who entered the CMO 
school two years earlier and not to the incremental impact of the second year of CMO enrollment. 
We focus most of our attention on the two-year impacts, because two years is the longest period we 
can examine with the full sample of 22 CMOs. 

In both reading and math, after two years of enrollment in CMOs, positive impacts are more 
common than negative impacts. Of the CMOs covered by the impact analysis, half (11 of the 22) 


® Refer to Appendix B Section III.B for more details on how the diversity of CMOs in the validation effort 
compares with the diversity of CMOs included in the impact analyses. 


51 


IV. CMO Schools’ Impacts on Students 


have positive impacts in math or reading while nine have negative impacts in one or both subjects 
(see Table IV.2). Ten of the CMOs have positive impacts in both subjects while only 4 have 
negative impacts in both subjects. 


Table IV.2. Number of CMOs With Positive and Negative Impacts in Math and Reading 



Significant Positive 

Math Impacts 
Insignificant 

Significant Negative 

Reading Impacts 
Significant Positive 

10 

0 

0 

Insignificant 

1 

2 

3 

Significant Negative 

0 

2 

4 


Source: State, district, and CMO school records 


2. The Range of Magnitude of Impacts for Individual CMOs Is Wide, Especially in Math, 

for Which the Positive Impacts of the Highest Performing CMOs Are Large 

In addition to counting the number of CMOs with positive or negative impacts, it is also 
important to examine the size of those impacts. In specifying the size of impacts, we first use the 
typical approach of calibrating them in terms of the overall standard deviation in test scores in the 
entire state or district. Figure IV. 1 below shows the distribution of estimated two-year math impacts, 
and Figure IV.2 shows the corresponding distribution for reading. Each of the 22 bars in the figure 
represents a single CMO impact, ordered left to right, from the most negative to the most positive. 
Statistically significant (insignificant) impacts are illustrated using darker (lighter) shades of red 
(negative impacts) and blue (positive impacts). One-year and three-year impacts for math and 
reading can be found in Appendix G. 

For the lowest performing CMOs, two-year” impacts are between -0.2 to -0.3 of a standard 
deviation in both math and reading. But the largest positive (and statistically significant) impacts in 
math exceed 0.6 of a standard deviation, twice the size of the negative math impacts of the lowest 
performing CMOs. In addition, these positive impacts for math are more than twice as large as the 
largest positive impacts in reading. Larger impacts in math than in reading were also observed in 
recent studies of charter schools such as the KIPP Lynn study (Angrist et al. 2010), the KIPP middle 
school study (Tuttle et al. 2010), the Harlem Children’s Zone Promise Academy study (Dobbie and 
Fryer 2011), the Boston charter schools study (Abdulkadiroglu et al. 2009), and the New York City 
charter schools study (Hoxby et al. 2009). 


We adjusted for multiple comparisons using the Benjamini-Hochberg correction and found that all impacts that 
were statistically significant remained significant post-adjustment (see Appendix F for details). To test if these impacts 
are homogenous, we performed a Q-test and were able to reject the null of homogeneity (Q-statistic is 1043.4 and 457.3 
for math and reading, respectively). 

** The CMO-level impacts in both subjects are highly correlated across years (0.89 between one-year and two-year 
math impacts; 0.94 between two-year and three-year math impacts). 
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Figure IV.l . Distribution of Test Score Effect Sizes After Two Years in Math 
0.80 
0.60 


OJ 

N 

In 

•M 

U 


0.20 

0.00 

0.20 

0.40 

0.60 

0.80 



III 

1 

1 1 


-Illl 

It 

II 

III 



I ■ ■ ■ ■ 


Negative, Insignificant 
Positive, Insignificant 


Negative, Significant 
Positive, Significant 


Source: State, district, and CMO school records. 

Note: “Significant” is defined as statistically significant at the .05 level, two- tailed test. 


Figure IV.2. Distribution of Test Score Effect Sizes After Two Years in Reading 
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Note: “Significant” is defined as statistically significant at the .05 level, two- tailed test. 
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3. The Differences Between High- and Low-Performing CMOs Are Large Enough to 

Produce Substantial Differences in Student Outcomes 

To get a better sense of the variation and magnitude of the CMO impacts, one can compare 
them to several other policy-relevant benchmarks. Data from the National Assessment of 
Educational Progress (reported in Bloom et al. 2008) show black-white and Hispanic-white 
achievement gaps ranging from 0.76 to 1.04 standard deviations in math and reading in grades 4 
and 8. These numbers suggest that the CMOs at the high end of the scale have the potential to 
measurably reduce achievement gaps, especially in math. Another relevant benchmark is the typical 
effect of a year of schooling. A few of the CMOs are producing impacts that appear to be sufficient 
to generate three years of learning gains within two years (Bloom et al. 2008).^^ Those CMOs could 
be seen as producing three years worth of learning over just two years in the classroom. By the same 
token, the lowest performing CMOs are producing negative achievement effects that are nearly as 
large as the effect of a year of schooling — that is, their students have achieved not much more than 
one year of learning after two years in the classroom. 

All of the impact estimates are, of necessity, measured against the performance of other local 
public schools. Our ability to make useful comparisons among CMOs is therefore constrained 
because each of them has a different counterfactual: Some of the variation in impacts among CMOs 
might be related to variation in the quality of the comparison schools rather than in the CMOs’ own 
performance. There is no way to direcdy test this possibility because we can’t relocate CMOs to 
different communities. Nonetheless, we can examine the extent to which CMO impacts appear to be 
driven primarily by the achievement trajectories of CMO students or by the trajectories of 
comparison students. It is theoretically possible that CMO students are experiencing achievement 
trajectories that are consistent across CMOs and that the variance in impacts is attributable to large 
variations in the trajectories of comparison students. If so, this would complicate our interpretation 
of the large variation in CMO impacts. 

In fact, we find that most of the variation in CMO impacts is attributable to variation in CMO 
students’ own achievement trajectories (adjusted via regression for their characteristics). For two- 
year math impacts, for example, the net achievement gains of CMO students range from -0.26 to 
0.55, whereas the net achievement gains of the matched comparison groups (for each CMO) fall in a 
narrower range around zero, from -0.22 at the low end to 0.10 at the high end. Consequendy, the 
esdmated impact of each CMO is correlated very highly (at .95) with the adjusted average 
achievement gain of its students. Meanwhile, the correlation of CMO impacts with the adjusted 
achievement gains of comparison students is much closer to zero, at -0.21. The sign of the average 
achievement gain of CMO students differs from the sign of the impact estimate in only one case out 
of 22 CMOs (for two-year math impacts). In short, the wide variation in impacts among CMOs 


Note that the effect sizes in Bloom et al. (2008) are based on a national population, whereas the effect sizes in 
this study are based on a single state or district. Because test score standard deviations are likely to be bigger nationwide 
than at the state or district level, our impact sizes may be slightly inflated as compared to those in Bloom et al. (2008). 
Bloom’s estimates indicate that the impacts required to achieve an extra year of learning within two years are about 0.2 
to 0.3 in reading and about 0.3 to 0.4 in math at middle school grades. 

'3 Jurisdiction-wide changes in achievement are zero by definition because we have normed test scores within grade 
and subject in each state or district. But the matched comparison group could have trajectories that are positive, 
negative, or zero. 

The correlation of CMO impacts with the achievement levels of comparison students is likewise low. 
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reflects a similarly wide variation in the achievement trajectories of their students rather than being 
primarily driven by local context. 

4. Estimated CMO Effects Are Broadly Consistent with Effects Measured for Other 

Charter Schools 

In a few jurisdictions, we were able to conduct parallel analyses of the effects of independent 
charter schools (that is, charter schools not affiliated with a CMO) alongside our CMO analyses.'^ 
We found no patterns in the relative impacts of independent charter schools versus CMOs across 
the jurisdictions. Results varied across CMOs and across jurisdictions: Some CMOs outperformed 
independent charters; others did not. In general, the magnitude of impacts estimated for 
independent charters was in the same range as the impacts we estimated for CMOs. Details are 
available in Appendix H. 

The most rigorous recent studies of charter schools have reported a range of impacts, 
depending upon the study. If one combines these studies, the span of impacts reported is broadly 
consistent with the range of our CMO impact estimates: 

• In a national sample of oversubscribed charter middle schools, Gleason et al (2010) 
found school-level variation in impacts ranging from -0.43 to 0.33 in reading and from - 
0.78 to 0.65 in math after two years. 

• A study of New York City charter schools estimated annual impacts which, if 
accumulated over two years, would imply effect sizes of 0.12 in reading and 0.18 in math 
(Hoxby et al. 2009). 

• In oversubscribed charter middle schools in Massachusetts, Angrist et al. (2011) found 
annual impacts which, if accumulated over two years, would amount to 0.13 in reading 
and 0.42 in math. 

Similarly, a study of 22 KIPP middle schools (Tuttle et al. 2010) examined the variability in 
impacts among schools. KIPP is not itself a CMO, because it does not have direct operational 
authority over schools; some of KIPP’s regions, however, are CMOs. The schools examined in the 
KIPP study would be in the upper half of the distribution of CMO impacts found here, but the 
magnitudes are broadly comparable. Two-year impacts for the 22 KIPP schools in reading ranged 
from -.12 to .43, whereas two-year impacts in math ranged from zero to 0.75. The KIPP schools 
would therefore be in the top half of the distribution of CMO impacts if we were to plot their 
results next to the CMO-speciflc results. 

5. CMOs Also Show Substantial Variation in Impacts on Science and Social Studies Tests 

States typically do not require science and social studies assessments in every grade. The largest 
number of CMO impacts in science and social studies could be examined only after three years since 
enrollment and only for a subset of our CMO sample — 11 CMOs for science and 9 CMOs for social 


*5 These comparisons relied on ordinary least squares (OLS) regression analyses rather than propensity match 
analyses because OLS analyses also successfully replicated experimental estimates in our validation exercise and because 
they can be conducted with far less labor and computing time. 
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studies. Nonetheless this is a large enough sample to provide useful information about the range of 
impacts generated by many CMOs. 

As in math and reading, there is substantial variation in CMO impacts on science and social 
studies achievement three years after enrollment. Figures IV.3 and IV.4 below show the distribution 
of estimated three-year test score impacts for science and social studies, respectively. For science, the 
number of CMOs with significant positive impacts is equal to the number of CMOs with significant 
negative impacts, and effect sizes range between -0.49 and 0.61. Estimated impacts for three-year 
social studies are positive and statistically significant for five out of the nine CMOs where we were 
able to estimate social studies impacts; only one CMO had a significantly negative impact on a social 
studies assessment. Three-year social studies impacts range between -0.48 and 0.41. 

6. Although Overall Average Two and Three-Year Test Score Impacts Are Positive in All 

Four Subjects, They Are Not Statistically Significant 

We also estimated the impact of the “average” CMO on achievement in each of the four 
subjects. Average impacts provide a sense of the overall contribution of CMOs to efforts to improve 
student achievement. With only 22 CMOs in the sample (and fewer for science and social studies 
impacts), however, average effects would need to be substantial in order to achieve statistical 
significance. ^ 

Estimated test-score impacts for the average CMO are presented in Table 1V.3 below. Although 
average two-year impacts are positive, they are not statistically significant at the five percent level. 
The average CMO’s two-year math impact is 0.11 and is marginally significant (p=0.08). Average 
impacts across CMOs for three-year science and social studies are positive and not statistically 
significant. 


Figure IV.3. Distribution of Test Score Effect Sizes After Three Years in Science 



Source: State, district, and CMO school records. 

Note: “Significant” is defined as statistically significant at the .05 level, two- tailed test. 


If we were to instead estimate the effect of CMOs on the average student, the math estimates would be 
significantly positive. 
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Figure IV.4. Distribution of Test Score Effect Sizes After Three Years in Social Studies 



Source: State, district, and CMO school records. 

Note: “Significant” is defined as statistically significant at the .05 level, two- tailed test. 


Table IV.3. Average CMO Test Score Impacts, by Year After CMO Enrollment 



1 - Year Impact 

2- Year Impact 

3- Year Impact 

Math 

0.06 

0.1 r 

0.1 5 


(0.05) 

(0.06) 

(0.09) 

Number of CMOs 

22 

22 

14 

Reading 

-0.01 

0.03 

0.05 


(0.02) 

(0.03) 

(0.04) 

Number of CMOs 

22 

22 

20 

Science 

N.A. 

N.A. 

0.06 

(0.09) 

Number of CMOs 



1 1 

Social Studies 

N.A. 

N.A. 

0.09 

(0.09) 

Number of CMOs 



9 


Source: State, district, and CMO school records. 

^Significantly different from zero at the .10 level, two- tailed test. 


7. Impacts Are Highly Positively Coffelated Within CMOs Among Academic Subjects 

Another question of interest is whether certain CMOs produce positive impacts in only certain 
subjects or whether CMOs tend to produce impacts of similar direction and magnitude across all 
subjects. Do CMOs appear to focus their efforts on increasing achievement in some subjects — such 
as math and reading, which often receive more attention from policymakers — at the expense of 
others? 
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We find that CMOs that produce positive impacts in one subject also tend to produce positive 
impacts in other subjects. The correlations are as high as 0.86 for (two-year) math and reading 
impacts; the lowest correlation is between (two-year) math impacts and (three-year) social studies 
impacts at 0.60. This suggests that CMOs either do not place more emphasis on particular subjects, 
or if they do, that there are positive spillover effects from skills students acquire for one subject to 
another. Table IV.3 contains the within-CMO correlations of impacts across the four subjects we 
examined for the years with the best data coverage. 


Table IV.3. Within-CMO Correlations of Impacts Across Math, Reading, Science, and Social Studies 



2- Year Math 

2- Year Reading 

3- Year Science 

3- Year Social 
Studies 

2- Year Math 

1.00 





(N=2 2) 




2- Year Reading 

0.86 

1.00 




(N=2 2) 

(N=2 2) 



3- Year Science 

0.81 

0.82 

1.00 



(N=l 1) 

(N=l 1) 

(N=l 1) 


3- Year Social 

0.60 

0.73 

0.79 

1.00 

Studies 

(N=9) 

(N=9) 

(N=9) 

(N=9) 


Source: State, district, and CMO school records. 

8. Among the CMOs in Our Study, Large CMOs Are More Likely Than Small CMOs to 

Have Positive Impacts 

We looked at whether there is a relationship between CMO size, as measured by the number of 
schools operated by the CMO in fall 2009, and two-year math and reading impacts. Figure IV. 5 
shows the distribution of estimated two-year math and reading impacts respectively on the x and y- 
axes, while the size of the bubbles represents CMO size. 

Large CMOs in our sample tend to have positive impacts, while small CMOs are more likely to 
have negative impacts. This might indicate that funders have had some success in supporting the 
expansion of CMOs that are more effective. In particular, eight of the 12 large CMOs (those 
operating more than 8 schools in 2009-10) have significant positive impacts in at least one subject, 
while only 3 of the 10 small CMOs (those operating 8 or fewer schools in 2009-10) have significant 
positive impacts in at least one subject. Meanwhile, only 2 of 12 large CMOs have significantly 
negative impacts in at least one subject, while 7 of 10 small CMOs have significantly negative 
impacts in at least one subject.’^ CMOs that have positive impacts in both reading and math operate 
an average of 12 schools, while those with negative impacts in both subjects operate an average of 6 
schools. Despite this pattern, effectiveness is not related to size in a Unear way: Correlations between 
math and reading CMO impacts and CMO size are not statistically significant. 


Among the 12 large CMOs, 7 have significant positive impacts in both subjects, 1 has significant negative 
impacts in both subjects, and 4 have a mix of significant and insignificant impacts (both positive and negative) in both 
subjects. As for the 10 small CMOs, 3 have significant positive impacts in both subjects, 3 have significant negative 
impacts in both subjects, and 4 have a significant negative impact in 1 subject and an insignificant impact in the other 
subject. 
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Figure IV.5. Comparing Test Score Effect Sizes After Two Years in Math and Reading and CMO Size 
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Source: State, district, and CMO school records. 

We also looked at whether absolute CMO growth (change in the number of schools operated 
by the CMO between fall 2004 and fall 2009) and relative CMO growth (the number of schools 
operated by the CMO in fall 2009 divided by the number of schools operated by the CMO in fall 
2004) are associated with two-year impacts in math and reading. In both of these cross-sectional 
analyses, we found no statistically significant associations. 

9. In Many CMOs, Reading Impacts Decline as the CMO Adds More Schools; Math 

Impacts Do Not Consistently Decline with Growth 

In order to explore the question of whether CMOs are able to expand and maintain their 
effectiveness on student test scores, we examined within-CMO changes over time in size and 
impacts. Specifically we gauged whether, as individual CMOs grow, their impacts tend to get larger 
(or smaller). To be eligible for this analysis, we require that CMOs have at least three different 
numbers of schools during the years covered by the analysis data. This eligibility criterion excluded 7 
out of the 22 CMOs in our sample. 

Based on the results of these longitudinal analyses, we conclude that some CMOs show 
declining impacts as they grow while others do not. Our within-CMO analyses suggest that CMO 
expansion often diminishes student impacts in reading: we found smaller reading impacts due to an 
additional school for nine out of twelve CMOs with statistically significant differential impacts. 
However, there is no clear pattern of changes in math impacts as CMOs grow. Figures IV.6 and 
IV.7 show the difference in magnitudes of two-year impacts due to an additional CMO school for 
math and reading respectively. 
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Figure IV.6. Distribution of Differential Test Score Effect Sizes Due to an Additional CMO School 
After Two Years in Math 



Source: State, district, and CMO school records. 

Note: “Significant” is defined as statistically significant at the .05 level, two- tailed test. 

Figure IV.7. Distribution of Differential Test Score Effect Sizes Due to an Additional CMO School 
After Two Years in Reading 



Differential Effect Size 


Source: State, district, and CMO school records. 

Note: “Significant” is defined as statistically significant at the .05 level, two- tailed test. 
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Nonetheless, despite the attenuation of reading impacts for many CMOs as they grow, it 
remains true that the larger CMOs in our sample more often have positive impacts, as noted above. 
In other words, successful CMOs may not fuUy sustain the impacts they produce in their first 
schools, but even after their impacts decline with growth, they tend to remain positive. 

10. CMO Impacts Do Not Generally Differ by Subgroup, But Several CMOs Have Larger 

Two-Year Math and Reading Impacts for Hispanic Students 

Prompted by prior studies of charter schools (Angrist et al. 2011; Gleason et al. 2010) that have 
found suggestive evidence of greater benefits for low-income minority students in urban areas, we 
examined whether two-year math and reading impacts were different for particular subgroups of 
students. The subgroups we focused on were Hispanic students, African American students, low- 
achieving students^* at baseline, students receiving free and reduced-price lunch, and male students. 
We were not able to examine all of these subgroups across our sample of 22 CMOs either because 
of data restrictions (we did not have free and reduced-price lunch information for all our 
jurisdictions) or because the student population in a particular CMO’s analysis sample was too 
homogenous.*^ Appendix I contains the impact estimates for each of these subgroups. 

There is some evidence of larger two-year math and reading impacts for Hispanic students. 
Figure IV. 8 shows the difference in the magnitude of two-year math impacts between Hispanic 
students and aU other students in each of the nine CMOs where we were able to estimate subgroup 
impacts for Hispanics. We found larger math impacts for Hispanic students for five out of six 
CMOs with statistically significant differential impacts. We found similar results for Hispanic 
students in reading. Other subgroups did not show clear patterns of differential impacts, positively 
or negatively, in reading or in math. 

E. Conclusion 

On average, CMO middle schools have effects on their students’ test scores that are marginally 
positive but not statistically distinguishable from the effects of other public schools nearby. The 
average impacts obscure large variations in impacts for individual CMOs. Most of the CMOs in our 
sample have impacts that are statistically significant in all measured academic subjects, either 
positively or negatively. After two years of enrollment, more CMOs have significant positive impacts 
than have significant negative impacts. The differences between high-performing and low- 
performing CMOs after two years of enrollment are large enough to be equivalent to a year or more 
of learning. 


We define low-achieving students as those who had a baseline test score below the 50th percentile of students in 
the relevant district or state. 

We require between 15 and 85 percent of students in our analysis sample to be part of a subgroup of interest 
(that is, Hispanic, African American, low achieving, students receiving FRPL, or male) before that particular subgroup is 
eligible for analysis. 
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Figure IV.8. Distribution of Differential Test Score Effect Sizes for Hispanic Students Relative to 
Effects for All Other Students After Two Years in Math 



Source: State, district, and CMO school records. 

Note: “Significant” is defined as statistically significant at the .05 level, two- tailed test. 


The largest positive impacts observed are in math, but within CMOs, impacts are correlated 
across subjects. CMOs with positive impacts in one subject tend to have positive impacts in other 
subjects, and we find no evidence that CMOs with positive impacts in reading and math have 
focused on those to the exclusion of lower-stakes subjects such as science and social studies. There 
is some evidence that CMO expansion can diminish student impacts in reading, but nevertheless, 
large CMOs in our sample more frequently have positive impacts than do small CMOs. Finally, 
several CMOs appear to have larger impacts for Hispanics relative to their impacts on other 
students, but we find little evidence that effects differ for other groups of students, defined in terms 
of gender, prior achievement, or income (as measured by free or reduced-price lunch eligibility). 
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V. STRUCTURES AND PRACTICES ASSOCIATED WITH STUDENT IMPACTS 


Key Findings 

• Among CMOs, school-wide behavior policies and intensive coaching of new 
teachers are positively associated with student impacts in both math and 
reading. 

• At the CMO level, we do not find impacts to be associated with use of a 
uniform curriculum, extended instructional hours, frequent formative student 
assessment, or performance-based compensation. 

• Intensive teacher coaching in CMOs may increase student achievement in part 
by increasing the frequency with which teachers modify their lesson plans 
using the results of student assessments. 


A. Introduction 

Might the variation in the practices of CMOs explain why CMO impacts vary substantially? To 
inform policy and practice, we examine the associations between CMO practices and impacts, with 
the aim of identifying practices that are associated with larger positive achievement effects on 
students. We also explore whether other factors, including CMO size and growth and state charter 
policies, are associated with impacts. 

The results in this chapter are best considered exploratory. Most of the analysis is based on 
bivariate associations between a single CMO-level practice and student impacts in math or reading. 
Although we also conduct some multivariate regression analyses of the relationship between impacts 
and several practices, multicolUnearity and limited sample sizes impede our ability to parse out the 
importance of each practice based on its association with impacts. And any observed associations 
between practices and impacts could be driven by other, unmeasured factors that are correlated with 
both the practice and impacts. Hence it is not possible to make causal inferences on the basis of 
these analyses. 

The analysis is conducted at the CMO level, reflecting our primary interest in the policies and 
practices of CMOs rather than of individual schools. The analysis covers the six primary hypotheses, 
discussed in Chapter III, of associations between CMO practices and impacts. Appendix Table K.1 
shows CMO-specific values for each of these measures, along with estimated impacts and baseline 
test scores. We also explore a longer list of 43 secondary hypotheses that relate CMO policies and 
practices to student impacts. These 43 secondary hypotheses include proposed mediators of the 
association between our primary hypotheses measures and impacts, alternative measures of our 
primary hypotheses, and other practices not captured by our primary hypotheses. All of the primary 
hypotheses and nearly all of the secondary hypotheses were defined before conducting the impact 
analysis. We performed multiple comparison adjustments for our primary hypotheses to test the 
robusmess of our main results accounting for multiple tests. 
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Results for secondary hypotheses should be considered especially exploratory, given the large 
number of them. We provide complete results in Appendix Table K.2 and Appendix Table K.3 for 
all secondary hypotheses, but in the main text of this chapter, we focus predominantly on those 
secondary hypotheses can shed light on the findings related to primary hypotheses. For secondary 
hypotheses, we do not adjust for multiple comparisons. 

In the remainder of the chapter, we first describe the methods and data in more detail. We 
begin our discussion of the results by presenting the associations between each primary hypothesis 
measure and impacts in math and reading. Next we summarize how impacts are related to the main 
primary hypotheses. We then elaborate findings related to the three primary hypothesis measures 
that are significantly associated with student impacts: school-wide behavioral policies, intensive 
teacher coaching, and extended instructional time. The discussion of the teacher coaching findings 
also includes a summary of our analysis of the interrelationship among coaching, measures of 
instructional coherence, and impacts. We then summarize our findings related to a number of 
staffing practices, including the use of Teach For America (TFA) and Teaching Fellow teachers. 
Finally, we present average impacts for the categories of CMOs discussed in Chapter III, including 
those defined by the primary hypotheses and by the prescriptiveness of CMOs. The methods 
employed in the chapter are summarized in Appendix J and the detailed results from aU the analyses 
are included in Appendix K. 

B. Methods Overview 

Our primary results make use of bivariate ordinary least squares (OLS) models with robust 
standard errors to gauge the correlation between student impacts and CMO practices (see Appendix 
J for more detail). The outcome variables are the estimated two-year middle school impacts of the 
CMO on students’ achievement in reading and math. The independent variables are measures of our 
primary and secondary hypotheses, the majority of which are constructed from responses to the 
principal survey (N = 19 CMOs, 294 principals). For practices measured by the principal survey, the 
responses of CMO principals and those of principals in matched district comparison schools are 
differenced to construct a measure of disparity of practices between CMO schools and nearby 
district schools. For some secondary hypotheses we make use of measures constructed from 
responses to the CMO central office staff survey (N = 17 CMOs) and the teacher survey (N = 12 
CMOs, 384 teachers). For these items, there are no comparison measures from districts or district 
schools. 

Any significant bivariate associations between CMO practices and student impacts may be 
spurious. A practice that appears to be positively associated with impacts may simply be correlated 
with other practices that are the real drivers of student outcomes. In addition to bivariate 
associations, we conducted a multivariate analysis that includes any measures of primary hypotheses 
that were significantly associated with CMO impacts in bivariate models. Because our sample size is 
limited and coUinearity among practices inflates standard errors in multivariate models, we must also 
remain cautious in drawing conclusions that practices that are not significantly associated with 
impacts in multivariate models are necessarily not related to student impacts. To balance the 
concerns raised by both bivariate and multivariate models, we present the results of both for our 
primary hypotheses. 
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C. Overview of Primary Hypotheses 

Below we focus on the significant associations between CMOs’ impacts and two of our six 
primary hypotheses (see Table V.l). Comprehensive school-wide behavior policies and an emphasis 
on coaching new teachers are at least marginally significantly associated with positive impacts in 
both subjects. 

Table V.l. Correlations Between Six Primary CMO Practices and Impacts 

Math Reading 


Consistent educational approach 

-0.08 

-0.04 


(0.12) 

(0.06) 

Comprehensive behavior policy 

0.18** 

0.08* 


(0.05) 

(0.03) 

Emphasize formative assessment 

0.15 

0.07 


(0.12) 

(0.05) 

Emphasize intensive teacher coaching 

0.19* 

0.08'' 


(0.07) 

(0.04) 

Emphasize performance- based compensation 

-0.02 

0.01 


(0.10) 

(0.05) 

Instructional hours per year 

0.1 5* 

0.07 


(0.07) 

(0.04) 


Source; State, district, and CMO school records, and Principal Survey 

''Significantly different from zero at the .10 level, two- tailed test. 

"Significantly different from zero at the .05 level, two- tailed test. 

**Significantly different from zero at the .01 level, two- tailed test. 

We found no significant relationship between impacts and three of our primary hypotheses 
measures: consistent educational approach, use of formative assessment, or performance-based 
compensation. Consistent educational approach was measured by whether all schools in the CMO 
use a single curricular model, whether the CMO provides support for selecting books and other 
instructional materials, and whether the CMO is responsible for selecting curricula and instmctional 
materials. The lack of association between a centralized educational model and CMO impacts 
suggests that effective CMOs need not implement a common, centralized curriculum across all 
schools. The lack of association between performance-based teacher compensation and student 
impacts is consistent with some previous research (Fryer, 2011; Glazerman and Seifullah, 2010). 

Frequent review of student test results by teachers, principals, and CMO central office staff is 
also not significantly associated with student impacts. By itself, frequent testing of students may not 
translate to gains in student achievement. As discussed later, student impacts do tend to be higher 
when teachers frequently revise their teaching plans in response to the results of assessment data, 
suggesting that student assessment data may be useful only to the extent that it is associated with 
changes in instructional practices. 

The marginally significant association between school-wide instructional hours and math 
impacts appears to be due to the fact that CMOs with more instructional hours also emphasize 
teacher coaching and school-wide behavior policies (see Appendix Table K.4). In the multivariate 
model, the magnitude of the associations between instructional hours and impacts is close to zero in 
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both subjects. In comparison to school-wide behavior policies and intensive teacher coaching, the 
association between instructional hours and student impacts is not robust. 

D. Findings 

1. Comprehensive Behavior Policies Are Positively Associated with Student Impacts 

Behavior policies have the potential to affect student achievement if they encourage students to 
focus, reduce the amount of disruptions, and increase time on task. As discussed in Chapter III, 
comprehensive behavior policies within schools were measured by an index that combines 
principals’ reports on five issues: (1) whether consistent behavioral standards and disciplinary 
policies are enforced, (2) whether schools have zero-tolerance policies for potentially dangerous 
behaviors, (3) whether schools have behavior codes with student rewards, (4) whether schools have 
behavior codes with student sanctions, and (5) whether the parent or student signs a responsibility 
agreement. There was substantial variation across CMOs in the extent to which CMO principals said 
their schools employed these policies. 

CMOs with comprehensive behavior policies in their schools tended to have more positive 
impacts on math and reading achievement. This association does not appear to be driven by the 
results for a select group of CMOs. Figure V.l plots CMOs’ estimated student impact in math on 
the y-axis against the average comprehensiveness of behavior policies in that CMO’s schools as 
compared to matched district comparison schools. Each point indicates a single CMO. The line of 
best fit is the linear regression line. There is a clear positive association between math impacts and 
comprehensive behavior policies across the full range of CMOs. The association between reading 
impacts and comprehensive behavior policies is not as strong, but again the results do not appear to 
be driven by outliers. 

The association between the implementation of comprehensive schoo-wideo behavior policies 
and student impacts is not driven by a single factor: each component of the composite is positively 
associated with impacts in both subjects (see Appendix Table K.5). Having school-wide zero- 
tolerance policies for potentially dangerous behaviors and having a behavior code with student 
sanctions are positively and at least marginally significantly associated with student impacts in both 
math and reading. 

Although impacts are associated with the presence of school-wide behavior policies, they do 
not appear to be associated with whether the CMO centrally sets student disciplinary policies and 
provides a system of rewards and punishments for student behavior (see Appendix Table K.2). 
Thus, it is the comprehensiveness of behavior policies within schools, not the extent to which 
CMOs set these policies, that may be related to more favorable outcomes. 
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Figure V.1. Comprehensive Behavior Policy vs. Math Impacts 



Source: State, district, and CMO school records and Principal Survey. 

2. Intensive Teacher Coaching Is Positively Associated with Student Impacts 

Intensive coaching of teachers has the potential to increase student achievement by increasing 
the quality of instruction. Coaching may be particularly important for new teachers who are stiU 
developing their teaching skills and practices. The intensity of coaching for new teachers is captured 
with an index that measures the frequency with which new teachers (1) are observed by coaches, (2) 
are observed by principals or other administrators, (3) receive feedback from observers, and (4) must 
submit lesson plans for review. 

More intensive teacher coaching is associated with more favorable student outcomes in both 
math and reading, although in reading the association is only marginally significant (p=0.07). Each 
component of the composite measure is positively associated with student impacts in both subjects 
(see Appendix Table K.6). As with behavior policies, the association is consistent across the range of 
math impacts and practices (see Figure V.2), and the results are similar in reading. 
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Figure V.2. Intensive Teacher Coaching vs. Math Impacts 



Source: State, district, and CMO school records and Principal Survey. 

Caution is merited in interpreting these results. When we include in a single model all three 
primary measures that are significantly associated with impacts — comprehensive school- wide 
behavior policies, intensive teacher coaching, and instructional hours — the effect of coaching is no 
longer statistically significant. Nonetheless, the magnitude of the association between teacher 
coaching and impacts still is non-trivial (the coefficient declines by only about a third) and our ability 
to isolate the independent contribution of coaching is limited by the small samples. 

Several of the secondary hypotheses related to teacher coaching are also associated with 
impacts. The professional development and coaching resources provided by CMOs and the staff in 
their central office are consistently positively associated with impacts. Principals’ reports that the 
CMO (or district) provides professional development support were positively and significantly 
associated with student impacts in both subjects. Additionally, the frequency with which CMO staff 
meet one-on-one with principals is positively and significantly associated with student outcomes in 
both math and reading. 

If coaching of teachers improves student outcomes, we might expect it does so by improving 
instructional practice. Indeed, in chapter 3 we noted that the extent to which teachers are observed 
appears to be correlated with the frequency with which they modify their lesson plans using student 
assessments (which is one of the components of our instructional coherence measure). And the 
frequency with which teachers modify lesson plans is positively associated with impacts in both 
subjects and mediates the effect of coaching on impacts in math (see Appendix Tables K.7 and K.8). 
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These results are consistent with the interpretation that teacher coaching improves student 
outcomes through altering teachers’ instructional plans and encouraging them to make greater use of 
individual student assessments. 

3. CMOs Using TFA and Teaching Fellow Teachers Have Higher Impacts, But Other 

Staffing Decisions Are Not Associated with Impacts 

In addition to coaching teachers, CMOs must make many other decisions about how to staff 
their central office and affiliated schools. They must determine the number of office staff and 
teachers to hire as well as how to select and compensate these staff. These decisions may affect 
several important factors including the skills teachers initially bring with them to schools, class sizes, 
and other kinds of support CMOs need to provide schools. We examined whether various staffing 
decisions are associated with math and reading impacts. 

Math impacts are higher among CMOs that rely more heavily on TFA and the Teaching 
Fellows programs as sources of new teachers. Specifically there is a statistically significant 
association between math impacts and the percentage of new teachers from these two sources, both 
of which tend to recruit and provide some training to recent graduates of highly selective colleges. 
One should be cautious about placing substantial weight on this finding because this is one of the 
many secondary hypotheses tested and the positive association could be due to random chance. 
However, it is possible that some promising CMO practices — such as adoption of school-wide 
behavior policies or longer instructional hours — are easier to implement in schools making use of 
TFA and Teaching Fellow teachers. 

We find litde evidence that other CMO hiring, staffing, or compensation practices are 
associated with student outcomes. CMOs’ strategies of compensation for principals and teachers, 
including the use of performance-based compensation, are not associated with student impacts (see 
Appendix Table K.2). Impacts are not significandy associated with the weight given during hiring to 
the teacher’s sample teaching performance or her commitment to the school’s mission. There is also 
no evidence that student impacts are associated with whether teachers have opportunides for tenure. 
Similarly, the allocadon of students to teachers is in general not associated with impacts, including 
instrucdon of students of similar ability together and teacher looping over grades, and the teacher- 
student rado. Although we find a negadve associadon between the rado of CMO central office staff 
to teachers in both math and reading, these associadons appear to be because of the results for two 
oudier CMOs and hence may be due to chance. 

4. CMOs Categorized As “Data-Driven” and “Time On Task” Have Larger Impacts, on 

Average, than Two Other Categories of CMOs 

The associadons between CMO practices and student impacts may not be independent. CMOs 
may achieve more favorable student outcomes when they implement a package of complementary 
practices designed to improve student learning. In Chapter III, we identified four clusters of CMOs 
that implemented distinct groups of practices. CMOs classified as Time on Task emphasize school- 
wide behavior policies, lengthened instructional hours, and more intensive teacher coaching, 
whereas CMOs in the Data-driven group emphasize frequent formative assessments, performance- 
based compensation, and intensive teacher coaching. CMOs in the Incremental Innovation groups 
are similar to district schools in their policies, and Alternative Approach CMOs are least likely to 
implement each of the practices in our primary hypotheses. 
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Variation in impacts across these four clusters is significant in both subjects, based on results 
from an F-test for homogeneity of impacts across clusters. On average, CMOs in the Data-driven 
and Time on Task clusters achieve the most favorable outcomes for students in both subjects. In 
math. Time on Task CMOs have the highest average impacts, whereas in reading the largest impacts 
are from the Data-driven CMOs. Average impacts are positive and significant for the Data-driven 
cluster in both subjects, and positive but only significant in math for the Time on Task cluster. By 
contrast, the Incremental Innovation group has average impacts close to zero, which is perhaps 
unsurprising, given their similarity to district schools. The Alternative Approach group has 
significandy negative average impacts in both subjects. 

Although average impacts among the clusters can be distinguished, there is also considerable 
variation in impacts within some of the clusters. In Figure V.3, each dot represents the estimated 
impact for a single CMO, with CMOs grouped by cluster along the x-axis. The variation is 
particularly large in the Time on Task cluster, which includes CMOs with impacts ranging from less 
than 0 to more than 0.6 of a standard deviation. 

The higher average impacts for the Time on Task and Data-Driven groups do not point 
towards any other promising practices aside from behavior policy and coaching. The relatively large 
impacts of the Time on Task cluster appear to be partly due to the emphasis these CMOs place on 
teacher coaching and behavior policy, but does not appear to be attributable to their longer 
instructional hours, a practice that is not associated with impacts in any of our multi-variate 
regressions including those that interact instmctional time with other practices. The relatively large 
impacts of the Data-Driven cluster are also partially explained by the emphasis of these CMOs on 
teacher coaching. We find no evidence that the favorable impacts for this cluster are due to their 
emphasis on either formative assessments or performance compensation, since these practices do 
not appear to be associated with impacts (see Table V.l); moreover, we find no evidence of an 
interaction effect (or synergy) between coaching and either performance compensation or formative 
assessment (see Appendix Table K.10). However, even after taking into account their use of teacher 
coaching, the Data-Driven CMOs have somewhat higher impacts than expected. Therefore, some 
unmeasured attributes or practices of Data-Driven CMOs may be partly responsible for their large 
positive impacts. 

5. Tightness of CMO Management Is Weakly Associated with Impacts 

In addition to variation in our core practice measures, CMOs exhibit considerable variation in 
the prescriptiveness of their management style. In Chapter III, three domains were identified in 
which CMOs might exhibit either a tight or loose management style: behavior policy, evaluation and 
compensation, and instructional approach. Based on their prescriptiveness in each of the three 
domains, CMOs were classified into four groups: Tight (all domains). Tight Evaluation and 
Compensation (loose instmctional approach and behavior policy). Tight Instructional Approach 
(loose behavior policy and evaluation and compensation), and Loose (all areas). 
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Figure V.3. Math Impacts by Core Clusters 



0.8 






0.6 



♦ 


•M 

U 

CL 




♦ 


E 

0.4 


♦ 

♦ 


•M 


♦ 




2 


♦ 




o 




♦ 


_c 






u 

LO 

0.2 





CD 



♦ 



■a 

■a 



♦ 

♦ 








L_ 

CD 

>- 

0 

♦ 






♦ 


6 


♦ 




g 


♦ 





-0.2 

♦ 



♦ 



♦ 



♦ 



Incremental 

Data- driven 

Time on Task 

Alternative 


-0.4 

Innovation 



Approach 


Source: State, district, and CMO school records and Principal Survey. 


Compared to the core clusters discussed in the previous section, the tightness of CMO 
management is only weakly associated with impacts. In reading, the variation in impacts across all 
prescriptiveness groups is not significant at the five percent level. In math, the variation across 
groups is significant, but the variation in impacts within groups is large (see Figure V.4). 

In both math and reading, average impacts are higher for the two groups that pursue a hybrid 
tight-loose approach than for the groups that are either tight in all domains or loose in all domains. 
However, few groups have average impacts that are significandy different from zero. Overall, CMO 
management style appears to be weakly associated with impacts. 
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Figure V.4. Math Impacts by Prescriptiveness Groups 
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Source; State, district, and CMO school records and Principal Survey. 
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VI. QUESTIONS FOR FUTURE RESEARCH 

As is often the case in studies of this kind, some of the interesting findings raise other 
important questions. Here we discuss a few that future studies might address, including some that 
will be explored in upcoming reports of this study. 

To what extent do CMOs produce positive effects on other student outcomes aside 
from academic achievement as measured by test scores? Although most of the CMOs in the 
study are focused on increasing academic achievement as measured by state student assessments, 
this is by no means their only objective. For example, nearly all CMOs have a long-term goal to 
prepare students for college. And most seek to cultivate students’ broad intellectual, social, and 
emotional development. Thus it is important to evaluate CMOs not only on how much they 
improve students’ academic test scores but also on the extent to which they achieve these other 
objectives. Some of these objectives are very difficult to measure empirically, but others simply 
require good longitudinal data over a longer period of time. A subsequent version of this report will 
explore the extent to which some CMOs affect two long-term outcomes: (1) high school graduation, 
and (2) college enrollment. 

Are some CMOs selecting the wrong models to replicate or having difficulty replicating 
promising school models? The development of CMOs was intended to scale up the most 
promising charter schools. But over 40 percent of the CMOs covered by our analysis are falling 
short of the performance of nearby district schools in math or reading. This raises questions about 
the models CMOs choose to replicate and the ways in which they implement them. For example, 
some CMOs may be scaling up the wrong models. Or perhaps they originally identified a promising 
model but have had difficulty replicating it. (In addition, as noted above, some CMOs may also be 
focused on other outcomes aside from increasing academic achievement as measured by test scores). 

The larger CMOs in our study have somewhat larger impacts on average than the smaller ones, 
which suggests that some of them have succeeded in replicating promising models. These CMOs 
have attracted sufficient amounts of funding and families to expand. Perhaps their funders and 
families have succeeded in confirming that the schools managed by these CMOs are effective. 

But we also found evidence that several CMOs have become less effective as they have grown, 
at least in terms of their impacts on reading skills, which declined for eight CMOs as they grew and 
rose for only one CMO. (In math, by contrast, most CMOs’ impacts did not change appreciably as 
they grew.) These findings suggest that CMO expansion can pose challenges. Our case studies 
suggest that CMOs encounter a number of issues as they grow, including difficulties finding teachers 
with the same skills as those in their first schools, providing principals with the right mix of direction 
and flexibility, and finding new facilities. 

Questions remain about how CMOs should seek to overcome these hurdles. For example, how 
can CMOs recruit and train staff effectively and provide them with effective incentives, guidance, 
and other forms of support? There are also questions about whether and how conventional public 
schools and school districts can replicate promising CMO strategies. Districts may encounter 
challenges implementing these strategies that are quite different from those faced by CMOs. A first 
step in helping others decide whether and how to adopt any CMO strategies is to describe them in 
more detail, an issue we discuss below. 
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Which promising strategies should CMOs and school districts implement, how should 
they implement them, and to what extent must specific strategies be bundled together to be 
effective? Our findings on promising practices are all tentative because they are based on 
correlations of practices and impacts. Even correlations that are statistically significant could be 
spurious. Nonetheless these findings point to important issues that can be explored in the future. 

Researchers could examine specific student behavior strategies to identify those that are 
effective in promoting better behavior and higher student achievement, and look for ways to 
implement them in more schools. A quarter-century ago, James Coleman and Thomas Hoffer (1987) 
argued that effective schools created communities in which behavioral expectations reinforced 
academic expectations. Our data indicate that the policies associated with CMO impacts include 
those that provide sanctions and rewards for specific student behaviors and, to a lesser extent, 
agreements with parents and students. Among the issues that should be explored are the best ways 
to specify and enforce sanctions and rewards and to train staff to implement them. 

Frequent teacher coaching is also associated with positive impacts. Presumably the content and 
intensity of coaching depend in part on a school’s educational and behavior policies. Previous 
experimental studies suggest that coaching programs do not necessarily pay off in the first year but 
sometimes do after two years of coaching individual teachers (Glazerman et al. 2010). Future 
research could explore what form of coaching is most effective. 

To what extent are some CMOs able to consistently improve student outcomes across 
their schools? One aim of CMOs is to improve the consistency of positive student impacts across 
multiple schools. This suggests that they should be evaluated not only based on the average impact 
across their schools but also based on the consistency of those impacts. In the future, we hope to 
explore this issue by estimating school-level impacts, examining the variability of impacts within 
CMOs, and assessing whether any practices are associated with both high CMO-wide impacts and 
greater consistency of impacts among a CMO’s schools. 

To what extent do CMOs add value compared to independent charter schools? CMOs 
seek to take advantage of both the autonomy associated with charter status and the scale implicit in a 
larger organization, which could provide benefits in several areas, including curriculum development, 
teacher training, and various administrative tasks. Whether CMOs can take advantage of scale 
without losing the benefits of charter status (that is, becoming indistinguishable from school 
districts) is a key question. Our study was not designed to precisely measure the extent to which 
CMOs improve student outcomes relative to independent charter schools. Indeed, most of our data 
collection and analysis focus on how CMOs perform relative to the most common nearby 
alternative — regular district schools. However, in four large districts we also estimated how CMO 
schools performed relative to independent charter schools in terms of their contribution to students’ 
academic achievement. Some CMOs did better than independent charters and others did worse. 
These analyses are insufficiendy comprehensive to estimate overall how CMOs perform relative to 
independent charter schools. Moreover, we did not collect information on how the practices of 
CMOs compare with those of independent charter schools, so we could not ascertain how 
differences in practices might be related to their relative performance. These important issues should 
be explored in the future. 

Are the newest generation of CMOs using the same strategies and producing the same 
impacts as the established CMOs in our study? Many more CMOs are operating today than 
could be included in our study. Since our study began four years ago, we focused on CMOs with 
four schools as of fall 2007. Hence many newer CMOs did not meet our criteria for inclusion. Some 
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of these — for example, those in New Orleans — have arisen in response to specific needs and 
contexts that differ substantially from those encountered by the older CMOs. And some may have 
had opportunities to learn from the experiences of the older CMOs. It remains to be seen whether 
these new CMOs are more or less effective than the older ones. 

What other factors might contribute to CMO impacts? A number of other differences 
between CMOs and districts, or among different CMOs, may contribute to CMO impacts. Some of 
these are difficult to observe or model and were not analyzed in this report. Other factors were 
measured but they did not show sufficient variation across CMOs to detect whether they 
contributed to impacts. Among the hypotheses we could not fully test in this study are ways in 
which impacts are related to high expectations in the classroom, funding, peer effects, grade 
configuration, and specific approaches to classroom instruction. We discuss these factors below. 

First, positive impacts might be channeled through high expectations for student 
achievement — sometimes described as a “no excuses” approach — as manifested in the intensity of 
instruction in the classroom and in systems that hold teachers and principals accountable. These are 
factors that some field research suggests are central in high-performing charter schools (see, for 
example, Angrist et al. 2011). However, surveys are not ideal for measuring whether a “no excuses” 
approach is in fact in use. Intensive observations of classroom practices and central office 
interventions could produce more knowledge about the importance of expectations. 

Second, differences in available funding — public and private, for operations and facilities — 
merit additional attention as a factor that might matter. Although we did not detect a relationship 
between funding and impacts, the tax forms of CMOs on which we had to rely are not likely to fully 
capture variation in resources available to different organizations. In addition, we could not develop 
comparable estimates of district and CMO per-pupil expenditures. An in-depth analysis of finances, 
ensuring comparability among CMOs and between CMOs and districts, could shed more light on 
the importance of resources. 

Third, because CMOs operate schools of choice, the families they attract are different in both 
measurable and unmeasurable ways, which may give rise to peer effects. The selection process of 
students is driven in part by who learns about and chooses to apply to CMO schools. It is possible 
that the parents or students who end up enrolling in some CMO schools are more motivated or 
have other assets. In addition, CMOs can encourage certain families to apply or enroll in their 
school; even those with random lotteries can target their recruitment efforts and ask students to sign 
agreements to attend regularly and do their homework. An individual student may benefit from 
being in the same school and classroom with other students with higher levels of motivation or 
parental support. If peer effects are contributing to CMO impacts, this does not mean that our 
impacts are improperly measured. Indeed, our experimental results suggest the impacts are accurate. 
But it could affect our understanding of the mechanisms behind the impacts: Peer effects may 
explain why CMO students do better than they would have had they been placed in a school or 
classroom where there are fewer students like themselves. If that turns out to be tme, it would also 
have important implications for policy: Similar effects might not be achieved, for example, if CMO 
practices were directly applied to conventional public schools that are not schools of choice. While 
peer effects can be challenging to estimate, future research should explore their importance. 

Fourth, charter schools often employ grade configurations that are less common in 
conventional public schools, including K-8 and K-12 configurations. Some research suggests that 
schools using longer grade configurations that eliminate the elementary-to-middle school transition 
produce better outcomes for their students (Jacob and Rockoff, 2011). A study of charter schools in 
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]/I. Questions for Future Research 


Chicago found substantial positive effects of charter schools on educational attainment, and noted 
that this might have been attributable to the fact that many of the schools had eliminated the 
transition from middle to high school (using grade configurations such as K-12 or 6-12) (Booker et 
al. 2011). Our study was not able to take a close look at the effects of varying grade configurations, 
because the absence of pre-entry baseline test scores precluded the examination of all schools 
beginning in kindergarten. In the future, historical data covering longer periods should at least 
permit the examination of CMO schools that eliminate the transition from middle school to high 
school. 

Finally, our data provided only limited understanding of the classroom dynamics within high- 
performing CMOs. CMOs with large positive impacts must be doing something in the interaction 
between teacher and student that leads to such impacts. As previously mentioned, we do not know 
exactly how many teachers succeed in implementing a “no excuses” approach or exactly what this 
entails. We do not have much information on the types of assignments and homework assigned, or 
the specific curricular programs teachers are using in math, reading, and other subjects. We do not 
know the details of the strategies teachers employ to engage students and manage their classrooms. 
Follow-up research involving intensive data collection in classrooms could help answer these 
questions. 
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APPENDIX A 


CONSTRUCTION AND ANALYSIS OF MEASURES USED IN CHAPTER III 

Several of our measures of CMO practices are composite variables. These composites were 
created by combining closely related survey items into a single measure, reducing measurement error 
and capturing the breadth of a construct. The composite variables and the items they each combine 
are listed in Table A.l. 

The process for creating these composite measures included a number of steps to maximize 
reliability and reduce dimensionality. We first identified all the items from the surveys that were 
conceptually related to a specific construct. Next we standardized the values for each of the items 
such that the overall mean for both CMO and comparison schools had a value of 0 and a standard 
deviation of 1.’ We used principal components analysis to confirm that the composite was 
unidimensional, excluding items that were not related to the underlying construct. We then 
computed the standardized Cronbach's alpha, an estimate of the internal consistency or reliability of 
a composite measure, and rejected composites with alphas smaller than 0.6.^ All of the composites 
passed these tests. 

To measure the CMO characteristics described in Chapter III, we chose to rely primarily on 
the principal survey, rather than on our central office staff survey or teacher survey, for several 
reasons. First, relative to the central office survey, the principal survey reflects the perceptions of 
respondents who are closer to the implementation of CMO practices in schools. Second, because we 
surveyed the principals of both CMO and nearby district schools, we could compare the responses 
of these two groups to gauge how CMO schools are distinctive. Moreover, the analysis of how 
impacts are related to practices described in chapter V, makes use of our measures of the differences 
in practices between each CMO school and its matched district school. Finally, the sample of 36 
CMOs (including 221 responding CMO principals and 171 responding district principals of 292 
matched pairs) covered by the principal survey is larger than the sample of 23 CMOs covered by the 
teacher survey, which also included only CMO teachers. Thus, while teacher respondents may be 
more attuned to the implementation of CMO practices in classrooms, relying on the principal survey 
enabled us to measure practices in a substantially larger sample of CMOs, as well as to draw 
contrasts with district schools. 

Our unit of analysis is the CMO.^ Some of our measures incorporate survey questions that ask 
explicitly about policies and activities of the CMO central office. However, we also rely on principal 
survey responses about school activities to infer CMO-level characteristics, averaging principal 


* We summed the items in each composite at the school level and restandardized the composite values after 
aggregation to the CMO level, such that each overall mean across CMOs and district comparison school groups had a 
value of 0 and a standard deviation of 1 . 

2 Conventionally, alphas greater than 0.7 are considered reliable, but because some of our composites had a small 
number of items, we accepted composites with alphas slightly smaller than this threshold. 

^ The unit of analysis for the comparison sample corresponds to the group of district schools matched with the 
schools within a CMO. Therefore, for CMOs with schools located in more than one district, this comparison unit 
aggregates responses from multiple districts, weighted by the number of CMO schools located in each district. 
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responses within each CMO and within the group of comparison schools associated with that CMO 
using weights to adjust for nonresponse. 

Of 292 CMO principals eligible for the study, 76 percent responded to the survey. Among the 
292 matched comparison principals, the response rate was 59 percent. Five CMOs eligible for the 
study declined to participate in the survey. 

Among participating CMOs, weights to adjust for principal nonresponse were created within 
each CMO and within each corresponding group of matched schools. Specifically, we created 
weighting cells within the CMO and district groups based upon CMO, school district, and a three- 
level indicator of whether the school was (1) a middle or high school, (2) an elementary school, or 
(3) contained elementary, middle, and high school grades (such as K-12). Schools that included 
elementary and middle grades were considered “elementary” for this indicator. Adjustments were 
calculated by summing the total sample of eligible schools within the weighting cell and dividing by 
the total of these weights for respondents only. 

The intraclass correlation coefficients for our primary measures of CMO practices indicate that 
a substantial portion of the total variation we observe in CMO school-level practices across our 
primary measures is due to variation across CMOs (Table A.2). As such, we assume that these 
school-based measures capture the emphasis the CMO central office places on a specific practice or 
some characteristic of the CMO, even if the CMO is not explicitly promoting the strategy in its 
schools. To estimate the intraclass correlation coefficients we began with the following random 
effects model: 

Yij = p + a; + Sij, 

where i indexes group (CMOs); j indexes principals within each CMO; p is an unobserved overall 
mean; aj is an unobserved random effect shared by all values in group I; and s^j is an unobserved 
error term. 




Table A.l. Survey Items Used in Composite Measures of Primary Practices 


Composite Measure 


Principal Survey Items Included 

Consistent educational approach, including 
curriculum and instructional materials 

1. 

2. 

All schools in CMO (district) must use singular 
instructional model 

CMO (district) provides support selecting some 
books/instructional materials 


3. 

CMO (district) responsible for selecting 
curricula/instructional materials 

Student behavior policies that include specific 
rewards, sanctions, and commitments 

1. 

School enforces consistent behavioral 
standards and disciplinary policy 


2. 

School has zero tolerance policy for potentially 
dangerous behaviors 


3. 

School has behavior code with student rewards 


4. 

School has behavior code with student 
sanctions 


5. 

Parent or student required to sign 
responsibility agreement 

Frequent review and analysis of student formative 
assessment data 

1. 

How often diagnostic test results are given to 
new teachers 


2. 

Frequency with which teachers meet to analyze 
student test score data 


3. 

Frequency with which principal meets with 
CMO (district) central office to review test 
scores 

Intensive teacher coaching and monitoring 

1. 

Frequency with which new teachers are 
observed by coaches 


2. 

Frequency with which new teachers are 
observed by principals/administrators 


3. 

Frequency with which new teachers receive 
feedback from observers 


4. 

Frequency with which new teachers must 
submit lesson plans for review 

Performance- based teacher evaluation and 
compensation 

1. 

Relative importance of student test scores to 
teacher education and seniority/tenure in 
determining teacher pay 


2. 

Relative importance of administrator 
assessments to teacher education and 
seniority/tenure in determining teacher pay 


3. 

Use of student test scores to evaluate teachers 
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Table A.2. Intraclass Correlation Coefficients for Primary Measures 


Primary Practice 

ICC 

Consistent educational approach, including 
curriculum and instructional materials 

0.51 

Student behavior policies that include specific 
rewards, sanctions, and commitments 

0.30 

Frequent review and analysis of student formative 
assessment data 

0.28 

Intensive teacher coaching and monitoring 

0.30 

Performance- based teacher evaluation and 
compensation 

0.40 

Amount of instructional time 

0.64 
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APPENDIX B 


VALIDATION OF IMPACT ESTIMATION APPROACH 

I. Introduction 

One of the central goals of this study is to rigorously assess the impacts of CMOs on student 
academic outcomes using non-experimental methods. This appendix describes how we examined 
whether a non-experimental, panel-based (NXP) research design using a propensity-score matching 
(PSM) approach can produce impact estimates that correspond to those produced by randomized 
experimental methods — the “gold standard” method for causal inference. 

This validation effort, conducted for a subset of CMO schools in which experimental analyses 
could also be conducted, finds that PSM produces impact estimates that are very similar to the 
benchmark experimental impact estimates in middle and high schools."* In both reading and math, 
PSM estimates differ from experimental estimates by 0.01 to 0.03 standard deviation units. Site- 
specific estimates across seven lottery sites indicate that PSM results correlate with experimental 
results at levels of 0.9 or higher. The differences between the PSM and experimental estimates are 
not statistically significant. As a sensitivity check, two alternate NXP approaches, exact matching 
(EM) and ordinary least-squares (OLS) regression, also produced impact estimates similar to the 
experimental method. These results provide evidence supporting the rigor of nationwide NXP 
impact estimates for the larger set of CMO schools where experimental methods are not feasible. 

Elementary schools were not included in this validation, because most students start at an 
elementary school in kindergarten or first grade, meaning there are no pre-entry baseline test scores 
that can be included in the NXP analysis. Pre-entry test scores are likely to be critical in the 
replication effort (Cook et ah, 2008). The typical state testing pattern also means that we have no 
achievement outcomes to measure until three or four years after kindergarten entry lotteries (as 
standardized tests are not administered until spring of second or third grade); we were able to 
identify only three schools with lottery records far enough in the past to allow us to observe these 
test scores, and the sample size was too small to be reliable (N=118). Given these obstacles, the 
validation focuses on middle and high schools. 

This appendix begins by explaining the need for the validation, and how it allows the larger 
study to take advantage of the different strengths of both experimental and NXP methods. We then 
describe the data used in the analysis followed by three methodology sections that describe the 
validation methods, the experimental methods, and the NXP/PSM methods. Finally, we present 
detailed results of the validation analysis and conclusions. 

II. Why Validate? 

Properly designed and implemented randomized experiments produce impact estimates that 
support stronger causal conclusions than any other method, by ensuring that the treatment and 


'• This validation included high schools because we plan to estimate high school impacts on test scores; these 
impacts will be reported for a broader sample of high schools in a future report. 
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control groups are similar on observed and unobserved characteristics prior to receiving an 
intervention (Shadish et al., 2002; Murnane and Willet, 2011). Thus, any statistically significant 
difference between group outcomes can be attributed to the impact of the intervention. 

In the charter-school context, randomized experiments can be conducted using the admission 
lotteries that oversubscribed schools conduct (Gleason, et ah, 2010; Dobbie and Fryer, 2011; Angrist 
et al., 2010). However, not all charter schools are oversubscribed; not all oversubscribed schools use 
lotteries; and not all schools using lotteries keep good records of winners and losers. Researchers 
have found that admissions lotteries can be used for experimental analysis in only a small proportion 
of charter schools nationwide.^ Of the 161 CMO middle and high schools in the target sample for 
the overall study (that is, schools with entry grades of 4-9), adequate data for a rigorous experimental 
analysis was available for only 12 schools and only select grades and cohorts of those schools. 
Moreover, it would not be surprising if the (oversubscribed, well-organized) schools where 
experimental analysis is possible are systematically different from the schools where it is not possible 
(see Abdulkadiroglu et al., 2009 for suggestive evidence of this). In sum, admissions lotteries can be 
used to conduct experiments producing strong causal inferences about a small subset of CMO 
schools, but they cannot be used to examine the impacts of most CMO schools across the country. 

In contrast, student records data are available to conduct NXP analyses for a large proportion 
of all CMO schools nationwide. NXP methods for estimating CMO impacts rely on longitudinally- 
linked data on individual students before and after they enter CMO schools. Our preferred NXP 
approach, propensity-score matching,*^ involves comparing the achievement of CMO students to 
non-CMO students who have a similar estimated likelihood of enrolling in a CMO school (that is, 
similar baseline achievement and other characteristics). Although NXP methods lack the strong 
causal validity of randomized experiments, because matched students may differ on unobserved 
characteristics, they allow assessment of the impacts of many more schools and CMOs. 

Thus, in this study experimental and NXP methods involve a tradeoff between internal and 
external validity. An experimental analysis on a small number of schools and CMOs included would 
not be representative of typical CMOs and schools. A NXP analysis would not enable strong causal 
inference, as students who attend CMO schools could be different in unobserved ways from the 
PSM-identified comparison students. By conducting experimental and PSM analyses in a subset of 
schools, we can assess whether PSM produces results that match the experimental impact estimates 
for the same schools and the same students. If the PSM approach successfully replicates 
experimental impact estimates, the replication provides greater confidence that the PSM approach 
will produce valid impact estimates when applied to the nationwide population of CMO middle and 
high schools. 


5 One national study found that less than 15 percent of charter middle schools could be included in a lottery-based 
analysis (Tuttle, Gleason, and Clark, 2011). 

Three common NXP approaches were considered: PSM, EM and OLS regression. PSM was preferred over EM 
because some schools districts were too small to provide exact matches for many CMO students. PSM was preferred 
over OLS, because PSM creates a plausible counterfactual and thus can meet WWC standards. We describe the PSM 
approach in depth in Appendix C. 
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III. Data 


a. Description 

We use student-level administrative data provided by state departments of education, school 
districts, and CMOs. Our validation sample includes data from four jurisdictions. The treatment 
schools included 12 oversubscribed middle and high schools across seven CMOs in 2006-2007, 
2007-2008, or 2009-2010 academic years. Three of the schools had oversubscribed lotteries in 
multiple years. 

We focused on two outcomes: reading/EngUsh language arts and math scores^ on state 
achievement tests one year after students enrolled in school following participation in a lottery (year 
1). Where data were available, we standardized test scores using state-level means and standard 
deviations for each grade and cohort. Otherwise, we used district-level means and standard 
deviations for test score standardization. 

Student characteristics available for both the experimental and NXP analyses were: baseline 
reading and math test scores (including missing test score indicators), sex, race/ ethnicity (African- 
American, Hispanic, white/other), baseline free- or reduced-price lunch (FRPL) eligibility status,* 
English language learner’ (ELL) status, special education status (lEP), and an indicator of whether a 
student attended a charter school in the baseline year. The analyses also included indicator controls 
for the school students applied to, with groups of schools that share applicants combined into 
experimental “sites.” Each site has a common jurisdiction, lottery year, and grade level, and thus the 
site controls for these factors. 

b. Diversity of Validation CMOs 

The seven CMOs included in the validation exercise were quite diverse. The sample CMOs had 
schools in three of the four U.S. Census regions: Northeast, South, and West. The sample CMOs 
had 80 schools in total, including 25 middle schools eligible for this study’s primary (NXP) impact 
analysis. 

Table B.l provides baseline (pre-enrollment) student characteristics for middle schools in six 
validation CMOs (one of the validation CMOs only had high schools) and baseline characteristics 
for middle schools of the other 16 CMOs in the middle school analysis. The validation CMOs are 
diverse. Prior to enrolling in the validation CMO middle schools, standardized student test scores 
were as low as -.11 and 0.08 in reading and math respectively, and as high as .63 and .53. The 
validation CMOs had as litde as 5 percent of special education students with an individualized 
education plan and as much as 14 percent. The percentage of students that were English language 
learners varies from 4 percent to 33 percent in validation CMOs. Finally, the validation CMOs were 


^ In some states, the specific math tests taken in the 8th and 9th grade vary. Because the test taken could depend 
on whether the student enrolled in a CMO school, in these states we were not able to include students entering high 
school in the math analysis. 

® One district did not have reliable information on students’ free- or reduced-price lunch status. 

^ One district did not include information on students’ English language learner status. 
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diverse in terms of race/ethnicity: the percentage of African-American students ranged from 11 
percent to 81 percent and the percentage of Hispanic students ranged from 17 percent to 75 
percent. 


Table B.I. Baseline Statistics for Validation and Non- Validation CMO Middle Schools 


CMO 

Reading 

Score 

Math 

Score 

Percentage 

Black 

Percentage 

Hispanic 

1 

-0.08 

-0.1 1 

Medium 

Medium 

2 

-0.07 

0.04 

Low 

High 

3 

-0.07 

0.16 

Low 

High 

4 

0.02 

-0.04 

Medium 

Low 

5 

0.05 

-0.03 

High 

Low 

6 

0.63 

0.53 

Low 

Low 

7 

-0.62 

-0.36 

Low 

High 

8 

-0.46 

-0.46 

Medium 

Medium 

9 

-0.07 

-0.07 

Low 

Low 

10 

-0.04 

0.10 

Low 

High 

1 1 

0.00 

-0.04 

Low 

High 

12 

0.01 

-0.04 

Medium 

Medium 

13 

0.01 

0.02 

High 

Low 

14 

0.03 

-0.01 

High 

Low 

15 

0.04 

-0.16 

High 

Low 

16 

0.05 

-0.09 

High 

Low 

1 7 

0.12 

0.19 

Low 

High 

18 

0.20 

0.22 

Low 

High 

19 

0.24 

0.1 1 

Low 

High 

20 

0.27 

0.27 

Low 

Medium 

21 

0.35 

0.46 

Low 

Medium 

22 

0.89 

0.93 

Low 

Medium 


(1) Notes: Validation CMOs are shaded gray. Within validation and non-validation g roupings, 

CMOS are ordered first by reading score (lowest to hi ghest) and then math s core, if the 
percentage African-Ame rican or Latino students was between 0 and 33 percent the CMO is 
labeled low, CMOs with percentages between 34 and 67 pe rcent are labeled medium, and 
CMOs with percentages greater than 67 percent are labeled high. 

rV. Replication Procedure 

a. Replicating Experimental ITT Estimates 

Two common analyses in the charter school literature are estimating the effect of receiving an 
offer of admission to a charter schools (an intent-to-treat or ITT estimate) or the effect of attending 
a charter school (an effect of treatment-on- the-treated or TOT estimate). These estimates are not 
identical, because some lottery winners decline their offers, and some losers at the time of the lottery 
end up enrolling in the schools as a result of wait-list admissions or other post-lottery changes. Both 
estimates are of interest as a policy matter: the ITT impact is relevant in assessing the likely effects 
of making more CMO enrollment spaces available; and the TOT impact is relevant in assessing the 
effect that can be expected for a student who enrolls. These different analyses estimate impacts for 
different groups of students and make different assumptions. 
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We chose to use ITT estimates, because experimental ITT estimates make fewer assumptions 
than experimental TOT estimates and therefore have stronger causal validity. Specifically, the 
validity of the experimental TOT impact estimates depends on the assumption that crossover 
students — lottery losers who enroll in the CMO schools — experience the same impacts as lottery 
winners who would have enrolled in CMO schools regardless of whether they won the lottery.'® 
Since the assignment variable for our lotteries is winning a lottery at the time of the lottery (as 
discussed in more detail in section V), the ITT estimates include substantial crossover of control 
students into treatment. As crossover students receive offers later, it is possible that their enrollment 
in the CMO begins after the start of the school year, which could mean that crossover students 
experience a different treatment and have different impacts than the lottery winners. (Unfortunately, 
the data do not indicate how often this occurs.) Smaller actual impacts for crossover students would 
produce TOT estimates that are biased upward. As the goal of the experimental analysis is to 
provide the most causally valid estimates to use as a benchmark, we report ITT estimates. 

An experimental ITT estimate is, in basic form, simply the average outcome for treatment 
students minus the average outcome for control students. In the absence of crossover, replicating 
experimental impact estimates is therefore equivalent to replicating the experimental control group 
(given that the treatment group is defined to be the same in experimental and NXP analyses). When 
some control group students cross over to attend treatment schools (“contamination”), however, 
replication is more complicated. It is impossible for an NXP approach to replicate the part of the 
experimental control group that crosses over, because the NXP approach finds comparison students 
only among the population that did enroll in treatment schools. Nonetheless, an NXP approach 
can attempt to replicate the experimental ITT estimate by splitting the estimate into components 
corresponding to the different groups of students who actually receive treatment. The supplement to 
this appendix derives the following quasi-experimental estimate as the most equivalent to the 
experimental ITT: 

(2) NXP ITT estimate = P.^ x B.,. - Pj. x 

Where P.^ is treatment group’s CMO enrollment rate, B.j, is the CMO impact on treatment 
enroUees, P^^ is control group’s CMO enrollment rate (crossover rate), and B(^ is the CMO impact on 
control enroUees. 

This equation has the virtue of aUowing the effects of CMOs on the control crossover students 
to differ from the effects on lottery winners who attend the schools. To repUcate experimental ITT 
results non-experimentaUy, we subtract the estimated NXP effects on the crossover students from 
the effects on the lottery winners who attend the schools. P^- and P(^ are observed in the lottery data, 
and B.J, and B(, are separately estimated using PSM on the experimental treatment and control 
students who enroU in CMO schools." 


'0 In other words, among the students who would attend the CMO schools regardless of whether they won or lost 
the lottery (that is “always-takers”), impacts experienced by the lottery losers must be equivalent to impacts experienced 
by the lottery winners. 

Experimental impact estimates are limited not only to the subset of schools in which lottery-based analysis is 
possible, but also to the subset of cohorts and individual students who were randomly assigned to treatment via the 
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b. First-Year Achievement Impacts in Math and Reading 


Many possible experimental estimates could be replicated in this particular study since there are 
test scores in reading and math across 12 schools, six CMOs, seven lottery sites, four jurisdictions, 
and multiple grades and cohorts. To maximize statistical power and the precision of the estimates, 
we chose (in advance of estimation) to pool all lotteries across cohorts, grades, schools, and CMOs 
for the students’ first year in the school to permit the inclusion of the most-recent lotteries. This 
provided two overall impact estimates — one for reading and one for math. In addition to producing 
primary esdmates of impacts across sites, we also examine the correlation between first-year 
experimental and NXP estimates site-by-site. 

V. Experimental Methods 

a. Sample and Baseline Equivalence 

The experimental sample frame consists of students who applied to an oversubscribed CMO 
school that used a random lottery to admit students. The treatment group is composed of applicants 
offered admission to a participating CMO school at the time of the lottery. ^ Applicants not offered 
admission at the time of the lottery form the control group. All students who provided consent, 
were in the correct application grade at the time of the lottery, were randomized in the lottery, were 
in the proper grade for the lottery, and had baseline test scores were included in the analysis. 

Some students applied to more than one experimental CMO school, meaning they could 
receive an offer to one CMO school even if they lost lotteries at the other school(s). In these cases, 
we treated aU schools sharing applicants as a single site. Further restrictions were made at the site- 
level to ensure the validity and power of the impact estimates. Experimental sites had to meet each 
of the following criteria to be included in the analysis: 

1. The overall and differential attrition rates must be lower than the What Works 
Clearinghouse maximum thresholds (liberal attrition standard); 

2. If we did not observe the lottery and consequendy were unsure of the randomizadon 
validity, any difference between treatment and control average baseline test scores must 
be less than 0.25 effect size and demographic differences must be less than 25 
percentage points;'^ 


(continued) 

lotteries. The NXP treatment sample only included the enrollees from the experimental analysis, excluding students in 
other years, grades, and cohorts, along with students who did not participate in the randomized lottery. 

The CMO enrollment rates of students admitted at the time of the lottery are substantially higher than students 
rejected at the time of the lottery, meaning that this measure provides enough random assignment to fairly test CMO 
impacts. Angrist et al. (2010) used a similar approach. A traditional experimental approach where assignment is based on 
whether students were ever admitted to a CMO school was not possible for two reasons. First, many schools with 
oversubscribed lotteries ultimately admitted aU students who not admitted at the time of the lottery, meaning there were 
no randomized students who never received an offer (that is, no control students). Second, many schools did not follow 
the randomization order when admitting students after the lottery. 

The effect size measure for test was Hedge’s g. Relatively large baseline differences were allowed because some 
of the sites were small and could have moderate baseline differences even if the randomization was sound. 
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3. The difference in CMO school enrollment between treatment and control groups must 
be at least 20 percentage points. 

After these exclusions, there were 579 treatment and 809 control students with baseline data 
who were eligible for the reading analysis, and 331 treatment and 574 control students with baseline 
data who were eligible for the math analysis. In the reading impact analysis, we excluded 52 
treatment and 74 control students because we were unable to obtain outcome data or they were in 
the wrong grade in the outcome year,^^ leaving a final analysis sample size of 527 treatment and 735 
control students. In the math analysis, we excluded 13 treatment and 26 control students for the 
same reasons, leaving a final analysis sample size of 318 treatment and 548 control students. Overall 
attrition in the reading sample was 9 percent, with no differential attrition between the treatment and 
control conditions. Overall attrition in the math sample was 4 percent, with a 1 percentage point 
difference between attrition in the treatment and control conditions. The low attrition levels in this 
study are unlikely to significantly bias impact estimates, according to the What Works Clearinghouse 
attrition standards. 

Consistent with minimal bias, baseline statistics of observable characteristics indicate that the 
final treatment and control groups were very similar for both reading and math impact analyses (see 
Table B.l) with no statistically significant differences (all p-values>.10). Table B.2 presents baseline 
statistics for students included in the math and reading analysis samples. Table B.2 does not include 
imputed values, but values for baseline test scores and demographic characteristics were imputed in 
the analysis. For baseline and pre-basekne test scores, we included a missing data indicator and set 
each missing test score to the state or districtdevel mean, which is zero by design. For students 
missing demographic variables (race/ ethnicity, gender, FRPL, LEP, lEP, baseline charter status), we 
recoded the missing values for these covariates to the mode across all students in the sample (not an 
English language learner, no lEP, not attending a charter school at baseline, receiving free/reduced 
price lunch, female, and Hispanic). We did not impute outcome test scores. Students who were 
missing either a math or reading test score in the follow-up year were excluded from the analysis 
when that test score was the outcome variable. 


The math analysis also excludes 396 students applying to a high school because the math test taken in the ninth 
grade in some states could be affected by treatment (that is, in some states ninth-graders take a course-specific rather 
than a grade-specific exam, and the school may affect the course students take). 

These students without outcome test scores most likely attended a private school or an independent charter 
school that did not provide data to their district. The students in the wrong grade in the outcome year either repeated or 
skipped a grade in the outcome year. 
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Table B.2. Baseline Statistics for Treatment and Controi Groups 


Characteristic 

T Mean 

C Mean 

Diff (T-C) 

p-value 


Reading Analysis (n: 

= 527 T and 735 C) 


Baseline Reading Score 

.11 

.07 

.04 

0.497 

Baseline Math Score 

.08 

.02 

.06 

0.328 

Prebaseline Reading 

.16 

.08 

.08 

0.183 

Prebaseline Math 

.13 

.12 

.01 

0.949 

% Male 

.51 

.50 

.01 

0.653 

% Black 

.43 

.43 

.00 

0.926 

% Hispanic 

.49 

.51 

-.02 

0.648 

% White/Other 

.07 

.06 

.01 

0.425 

Baseline % FRPL 

.76 

.75 

.01 

0.850 

Baseline % LEP 

.09 

.10 

-.01 

0.859 

Baseline % iEP 

.06 

.08 

-.02 

0.313 

Baseline % Charter Attendance 

.09 

.1 1 

-.02 

0.465 


Math Analysis (n= 

318 T and 548 C) 


Baseline Reading Score 

.12 

.09 

.03 

0.696 

Baseline Math Score 

.08 

.02 

.06 

0.313 

Prebaseline Reading 

.14 

.09 

.05 

0.454 

Prebaseline Math 

.10 

.08 

.02 

0.817 

% Male 

.53 

.47 

.06 

0.128 

% Black 

.61 

.62 

-.01 

0.857 

% Hispanic 

.37 

.36 

.01 

0.823 

% White/Other 

.02 

.02 

.00 

0.883 

Baseline % FRPL 

.83 

.82 

.01 

0.907 

Baseline % LEP 

.06 

.05 

.01 

0.542 

Baseline % iEP 

.06 

.08 

-.02 

0.437 

Baseline % Charter Attendance 

.04 

.02 

.02 

0.208 


Notes: These tables include only students who we re included in the reading and math analyses. 

Students with missing reading outcome data are excluded from the reading analysis sample; 
likewise, students with missing math outcome d ata are excluded from the math anal ysis 
sample. Reading and math te st scores are standardized using the state district mean and 
standard deviation. All statistics are weighted to account for admission probabilities. 


98 


b. Experimental ITT Estimation and Weights 

To estimate an experimental intent- to-treat impact, we compared outcomes of applicants 
offered admission at the time of the lottery to those of applicants rejected at the time of the lottery, 
controlling for students’ previous test scores and demographic characteristics^'’ The impact 
estimation model is: 


(2) yj — oc -fXjP -f 6Tj -I- SjB -I- £j, 

where y is the reading or math test score outcome for student i; a is the intercept; X; is a vector 
of achievement and demographic characteristics (see Table IV. 1); Tj is a binary variable for 
treatment status, indicating whether student i was admitted at the admission lottery; S; is a vector of 
indicators identifying which site the student applied to; s is a random error term that reflects the 
influence of unobserved factors on the outcome; and p, 8, and 0 are vectors of parameters or 
parameters to be estimated. The estimated coefficient on treatment status, 8, represents the impact 
of admission to a CMO school at the time of the lottery. The model assumes that the treatment 
indicator and covariates influence the outcomes in the same way across all sites. To examine how 
CMO impacts varied by site, we also estimated a model that interacted the treatment indicator with 
each site indicator. The site-specific impact estimation model is: 

(3) Yi —OC. +Xj(B -f 6Tj -f Sj0 -I- (Tj x Sj)(p -I- £j, 

where all the terms are the same as in Equation 2 except for cp, the estimated coefficient on the 
interaction terms, which is a vector of estimated coefficients on the interaction terms. Students were 
weighted to account for admission probabilities. 

VI. Propensity-Score Matching Method 

In order to replicate the experimental ITT impact estimate, we require two sets of non- 
experimental impact estimates: a treatment group enroUees’ impact estimate and a control group 
enroUees’ (crossover) impact estimate (following Equation 1). The treatment group enroUees are 
lottery winners who attended an experimental CMO school. The control-group enroUees are lottery 
losers who attended an experimental CMO school. For both groups of enroUees, comparison groups 
are identified among students who did not attend an experimental CMO school. 

a. Estimating Propensity Scores 

The first step is to estimate a propensity score for each student in the sample. To determine the 
appropriate propensity score model for each of the two enrollee groups, we use a forward model 
selection procedure for the logistic regression. Because baseline math and reading test scores are 
some of the strongest predictors of later outcomes, we specify that the model-building procedure 
begins with the model containing the two baseline test scores and corresponding missing test score 
indicators. At each subsequent step, the forward procedure adds a term from a specified set of 


As student admission to CMO schools was randomly determined, we could simply compare the mean outcomes 
of the treatment and control groups. However, to obtain more precise impact estimates, we adjust for baseline student 
characteristics in a regression model. 
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potential covariates to optimize model fit to the data. The procedure could select from a list of 52 
potential covariates: the 11 observed baseline covariates, 39 two-way interactions of these covariates, 
and 2 interactions of test scores with themselves (i.e., quadratic terms). Table B.3 shows baseline 
covariates and interaction terms included in the final propensity models. These models fit the data 
well as indicated by the Hosmer and Lemeshow Goodness-of-Fit test />-values. The propensity-score 
method is described in more detail in Appendix C. 


Table B3. Covariates Included in the Final Propensity Score Models 



Treatment-G roup Enrollees 

Control-Group Enrollees (Crossovers) 


Baseline Math Test Score (MATH) And 

Baseline Math Test Score (math) And 


Corresponding Missing Indicator 
(missmath) 

Corresponding Missing Indicator 
(missmath) 


Baseline Reading Test Score (read) 
And Corresponding Missing Indicator 
(missread) 

Baseline Reading Test Score (read) 
And Corresponding Missing Indicator 
(missread) 


Sex 

Sex 


Free/Reduced Price Lunch (FRPL) 

Free/Reduced Price Lunch (FRPL) 

Baseline Covariates And 
Interaction Terms 

Special Education (iep) 

English Language Learner (ELL) 

Jurisdiction 

Jurisdiction 

Grade 

Grade 


missmath*sex 

math*read 


missmath*frpi 

math*ell 


read*frpl 

missmath*ell 



sex*ell 



math*Frpl 



math*site 



read*site 



sex*site 

Hosmer & Lemeshow 
Goodness-Of-Fit Test p- 
value 

0.45 

0.78 


b. Select Closely-Matched Comparison Students 

After estimating the propensity scores, we identified comparison students whose estimated 
propensity scores were similar to those of each treatment student (that is, comparison students who 
had similar probabilities of enrolling in CMO schools). The selection used caliper matching, whereby 
a given treatment student was matched to all comparison students with estimated propensity scores 
within a specified range (or caliper), rather than merely selecting a specified number of nearest 
neighbors. The sampling occurred with replacement. The matching procedure was implemented 
separately for each jurisdiction. To improve statistical precision, we selected multiple comparison 
students for each treatment student. 

For math outcome samples, the matched comparison students on average had similar math and 
reading test scores as the treatment students (Table B.4 and A. 5). They also had similar distributions 
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on all demographic covariates, with the exception of race/ethnicity and baseUne charter school 
attendance. The results were similar for reading outcome samples (not shown). 


Table B4. Baseline Statistics for Treatment- Group Enrollees (Math Outcome) 


Prior Student Achievement 
or Student Characteristic 

Treatment (n= 200) 
Mean/Percentage 

Comparison (n= 5,905) 
Mean/Percentage 

Difference 

Baseline Math Score 

0.10 

0.07 

0.02 

Baseline Reading Score 

0.10 

0.06 

0.04 

Race/Ethnicity 

African American 

0.58 

0.30 

0.28** 

Hispanic 

0.40 

0.48 

-0.08** 

White/other 

0.02 

0.22 

-0.20** 

Male 

0.60 

0.59 0.01 


Free/Reduced Price Lunch 

0.84 

0.87 

-0.03 

Special Education 

0.05 

0.05 0.00 


English Language Learner 

0.06 

0.1 1 

-0.05* 

Attended Charter School at 
Baseline 

0.05 

0.02 0.03'^* 


'^Significantly different from zero at the .1 0 level, two-taile d test 


'■'■Significantly different from zero at the .05 level, two-taile d test. 


Table B.5. Baseline Statistics for Control- Group Enrollees (Math Outcome) 


Prior Student Achievement 

Treatment (n= 1 24) 

Comparison (n=3,695) 


or Student Characteristic 

Mean/Percentage 

Mean/Percentage 

Difference 

Baseline Math Score 

-0.01 

0.03 

-0.03 

Baseline Reading Score 

0.05 

0.08 

-0.03 

Race/Ethnicity 
African American 

0.56 

0.32 

0.25** 

Hispanic 

0.42 

0.46 

-0.04** 

White/other 

0.02 

0.22 

-0.20** 

Male 

0.40 

0.40 0.00 


Free/Reduced Price Lunch 

0.89 

0.84 

0.05 

Special Education 

0.08 

0.10 

-0.02 

English Language Learner 

0.02 

0.02 

0.00 

Attended Charter School at 
Baseline 

0.01 

0.02 

-0.01 


*Significantly different from zero at the .1 0 level 
'■''Significantly different from zero at the .05 level 


There were only a few students who were not African-American or Hispanic in the treatment group. While the 
race/ethnicity variable was selected by the model selection procedure, the associated coefficients had large standard 
errors. As a result, race/ ethnicity were excluded from the propensity score matching model, resulting in an imbalance 
between treatment and matched comparison students. However, race/ethnicity was a covariate in the impact estimation 
model. In addition, the PS estimates are almost identical to those from the exact matching approach which included 
race/ ethnicity as a matching characteristic. This suggests that our approach was robust to the exclusion of this variable 
from the propensity model. Similar problems occurred with baseline charter school attendance. 
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c. Impact Model 

Following the creation of matched samples, we estimated impacts using an OLS regression 
model; covariates were included to improve statistical precision and to control for any remaining 
differences in baseline characteristics. The regression model was identical to the model used in ITT 
experimental analysis — Equations 2 and 3. Here, however, the treatment indicator, T, corresponds 
to each of the two enroUee groups (treatment-group enroUees and control-group enroUees) defined 
at the beginning of this section. The parameter of interest in Equation (4) is p, which corresponds to 
the impact estimate. 

In estimating impacts, enrolled students were weighted to account for the probability of 
winning a lottery admission offer. The matched comparison students are assigned the analysis 
weight for the enrolled students to whom they are matched. The experimental weights were rescaled 
so that a given site has the same weight in both the experimental and the PSM approaches. This 
weighting ensures that any potential differences between experimental and PSM estimated impacts 
can be attributed to the approaches themselves, rather than differences in weights. 

d. Alternative NXP Approaches: Exact Matching and OLS Regression 

As noted, two alternate NXP approaches were used to test the sensitivity of results to the 
particular NXP approach. These alternate approaches are exact matching (EM) and ordinary least 
squares regression (OLS) without matching. 

Exact matching uses comparison group students who exacdy match treatment students on a set 
of demographic characteristics and have very similar baseline test scores (CREDO, 2009). To be 
selected, the comparison students had to exactly match the treatment students on the following 
categorical characteristics: baseline charter school attendance, sex, race/ ethnicity, FRPL eligibility 
status, English language learner status, individualized education program status (special education), 
grade in outcome year, cohort, and jurisdiction. Exact matching on continuous characteristics — such 
as baseline math and reading test scores — would rarely identify matches, so we define a comparison 
student to be an exact match if his or her test score falls within 0.10 standard deviations of the 
treatment student’s baseline test score in the same subject. Following the creation of the matched 
comparison group, impacts were estimated using the same regression model used in the 
experimental and PSM analyses. 

The OLS-only approach does not attempt to create a matched comparison group of students. 
Instead, the approach uses the entire population of non-CMO students in the local jurisdiction as 
comparisons, relying entirely on covariates to adjust for baseline differences between treatment 
students and other students. The same OLS regression model used to estimate impacts in aU of the 
other approaches was used. 

VII. Results and Conclusions 

The PSM approach successfully replicates average experimental ITT impact estimates for the 12 
CMO schools in the replication sample. ITT impact estimates for the two approaches are reported 
in Table B.6. PSM produces ITT impact estimates within 0.01 of the experimental ITT estimate in 
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math and 0.03 of the experimental ITT estimate in reading. Neither of the differences is statistically 
significant, and they differ in opposite directions (that is they do not consistently under- or over- 
estimate impacts). In short, we find no evidence that PSM produces biased impact estimates.^® The 
last two rows of Table VII. 1 show that exact matching and OLS, like PSM, produce impact 
estimates that are very close to experimental impact estimates. 


Table B.6. Experimental and NXP Impact Estimates in 12 CMO Schools 



Math 

Reading 

Impact 

SE 

Impact 

SE 

Experimental ITT 

0.09** 

0.04 

0.00 

0.04 

PSM ITT (baseline NXP approach) 

0.08 

0.07 

0.03 

0.06 

Alternate NXP Estimates 

ITT Based on Exact Matching (EM) 

0.08 

0.07 

0.02 

0.06 

ITT Based on OLS 

0.07 

0.07 

0.02 

0.06 


**indicates statistically distinguishable from zero with 95 percent confidence. 


Moreover, NXP impact estimates at the site level are very similar to experimental site estimates, 
as the high correlations in Table B.7 indicate. At the site level, PSM ITT impact estimates correlate 
with experimental ITT impact estimates at 0.97 in math and 0.90 in reading. Site-level results are also 
very similar to experimental impacts for exact matching and OLS approaches. 


Table B.7. Experimental and NXP Impact Estimates at the Site Level 


Lottery Site 


Reading 



Math 

EXP 

PSM 

EM 

OLS 

EXP 

PSM 

EM OLS 

A 

0.31 

0.36 

0.38 

0.34 

0.05 

0.12 

0.08 0.10 

B 

n/a 

n/a 

n/a 

n/a 

0.14 

0.17 

0.16 0.13 

C 

-0.1 1 

-0.08 

-0.1 1 

-0.13 

-0.15 

-0.21 

-0.25 -0.18 

D 

0.05 

0.03 

0.06 

0.03 

o 

p 

-0.04 

-0.01 -0.03 

E 

0.19 

0.14 

0.1 1 

0.14 

0.1 1 

0.1 1 

0.11 0.08 

F 

-0.03 

-0.08 

-0.07 

-0.06 

-0.06 

-0.1 1 

-0.08 -0.09 

G 

n/a 

n/a 

n/a 

n/a 

-0.10 

-0.01 

-0.04 -0.002 

Correlation with EXP 

n/a 

0.97 

0.96 

0.99 

n/a 

0.90 

0.90 0.88 


** Standard errors for NXP estimates are rough. They assume zero covariance among site-specific impact 
estimators and between impacts on treatment and control enrollees. 
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In sum, this validation analysis suggests that propensity-score matching with baseUne test scores 
is capable of producing impact estimates that are not only unbiased but that also replicate 
experimental impact estimates with a high degree of precision. 

Supplement: NXP Derivation of Experimental ITT Estimate 

Notation: 

jU^= mean outcome of always-takers in experimental group k, where k G [T, C} 
mean outcome of compilers in experimental group k, where k G {T, C} 

= mean outcome of never- takers in experimental group k, where k G {T, C} 
p"^= proportion of lottery participants who are always-takers (is equivalent between T and C) 
p‘"’’= proportion of lottery participants who are compilers (is equivalent between T and C) 
p^= proportion of lottery participants who are never-takers (is equivalent between T and C) 
pf = proportion of experimental group k that enrolled in CMOs 
Note that pj=p^+ p^^ and Pc = p^. 

Not all of the [x parameters can be estimated. In fact, only pj and observable. However, 

we can always estimate the p parameters because the percentage of always-takers, compilers, and 
never-takers in each treatment group is the same due to randomization, and p^ and p^ are observed 
enabling the calculation of p*". 

Experimental ITT Impacts: 

(1) = (p'^Pt + P^^ Pt^ + p’^ P t) ~ (p^Pc + Pc^ + Pc) 

= p\p^ - p^) + p^^ip^/ - pgo + p^ip^ - P^c) 

If we assume that the exclusion restriction — that is assignment only affects outcomes through 
attendance — holds for never-takers who do not attend CMO schools, but not for always-takers who 
may receive different treatment at CMO schools depending on assignment, then pj — Pq = 0, so 

(!’) = p\p^ - p^) + p^^ip^/ - pF) 

NXP Impacts: 

In the NXP, every CMO enroUee from the experimental treatment group is matched with an 
observationally similar student from a district school. 

CMO enrollees from the experimental treatment group have the following mean outcome, which we 
denote by pf : 

+-P^,PP 

k-T pA+pCP Pt + pA+pCP Pt . 

which is just a weighted average of the always-taker mean and the compiler mean in the 
experimental treatment group, with weights equal to their proportional representation among CMO 
enrollees in the experimental treatment group, pf is observed but p^: and pj^: are not observed 
because always-takers and compilers cannot be distinguished. 
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Consider the matched comparison sample in the NXP analysis. Let [Xq denote the mean 
outcome in the matched comparison sample, with the subscript “D” connoting “district”. 

The TOT impact in the NXP is: 

(2) T0T<2™ = 

What is [Xq? The matched comparison sample consists of: (1) district students who are matched 
to treatment group always-takers, and (2) district students who are matched to treatment group 
compilers. Since we cannot distinguish always-takers and compilers in the treatment group, we also 
cannot distinguish the two subgroups of the matched comparison sample. Nevertheless, [Xq can be 
expressed as a weighted average of the unobservable mean outcomes of the two aforementioned 
subgroups: 

u - f ..MA , ..MC\ 

(3) (^p^+pCP + pA+pCP Md J 

Where is the mean outcome for district students matched to treatment group always- 
takers, and is mean outcome for district students matched to treatment group compilers. 

Substituting (3) into (2) and rearranging terms gives: 

A CP 

(4) roT<2™ {liV - ti'S"). 

If CMO schools have an impact, then ^ because the outcome of the CMO enrollees in 
the experimental control group will reflect that impact but the matched district students’ outcome 
will not. However, if the district students who are matched to treatment group compilers are an 
excellent proxy for the experimental control group compilers (more reasonable since experimental 
control group compilers do not attend CMO schools), then which means: 

A CP 

(4-) (iij - ^r) + 

Since the NXP goal is to get as close to the experimental ITT as possible, multiply (4’) by 
(p*+p^*^ to eliminate the denominators on the right hand side of equation (4’). Moreover, as noted, 
pf = . Therefore, 

(5) pf X = p^(p^ - - pg^). 

This is very close to the experimental ITT expression in equation (T). The only difference is 
that equation (5) has (p^ ~ Pd"^) instead of (p^ — Pc)- As noted, this is a substantial difference 
because the mean outcome of always-takers in the experimental control group, p^, reflects the 
contribution of CMOs while the mean outcome of district students who are matched to treatment 
group always-takers, does not reflect any contribution of CMOs since these students did not 
enroll in CMOs. 
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However, let us conduct another matching exercise in which CMO enroUees in the 
experimental control group (that is, the control group always-takers) are matched with district 
students who are similar on observed characteristics. Let TOJ^^ denote the difference in outcomes 
between CMO enrollees in the experimental control group and their matched counterparts in the 
district. In the best-case scenario, the district students who are matched to CMO enrollees in the 
experimental control group have the same mean outcome as district students who are matched to 
always-takers in the experimental treatment group. This assumption is reasonable because both 
groups of district students are being matched to always-takers from the lottery and randomization 
means the always-takers should be equivalent at baseline. Under this assumption: 

(6) TOT^^ = lie- leads to: 

(7) X TOT^^ = pKlii - li^) = P^lii - mD- 
Finally, subtracting equation (7) from equation (5), gives: 

(8) pf X -Pc^ TOT^^ = p^(p^ - - pf) - p^(p^ - 

= p\p^-pi)+p^^(pr-pn 

_I^jvjpEXP 

Therefore, pf X (TOT^^^') — Pc should be similar to the experimental ITT impacts. 

Note that the better the matches created by the NXP matching process, the more likely the equality 
in equation 8. 
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APPENDIX C 


PROPENSITY SCORE MATCHING METHODOLOGY 

To obtain a matched comparison group, we used a propensity score matching (PSM) procedure 
(Rosenbaum and Rubin 1983). While we used a uniform approach across CMOs to estimate 
impacts, our main impact analyses were conducted separately for each CMO. This appendix 
describes our approach to estimating propensity scores, matching treatment and comparison 
students, estimating CMO-specific impacts, and conducting sensitivity analyses. 

1. Estimating Propensity Score 

The first step for the PSM approach is to estimate a propensity score for each student in the 
sample. The potential comparison students, however, are more diverse in some of their baseline 
characteristics than the treatment students. To improve the fit of the propensity score model, we 
conducted a common support check that excluded from the sample any potential comparison 
students who were outside of the range of the treatment students’ baseline characteristics prior to 
selecting the propensity model. For example, if baseline math test scores for treatment students in a 
given CMO were between -1.7 and 1.9, only potential comparison students with test scores within 
this range were kept. Similarly, if aU treatment students in a given CMO were classified as non- 
English language learner (ELL) students, the potential comparison group sample was restricted to 
non-ELL students as well. Covariates included in the common support check were baseline math 
and reading test scores, indicators for missing baseline test scores, sex, race/ ethnicity, free- or 
reduced-priced lunch (FRPL) status, disability (lEP) status, English language learner (ELL) status, 
whether a student attended a charter school in the baseline year, baseline grade, cohort, and district. 

A propensity score model was then developed using an automated stepwise model selection 
procedure for the logistic regression in SAS 9.1. Because baseline math and reading test scores are 
some of the strongest predictors of later test scores (Glazerman, Levy, and Myers 2003; Cook, 
Shadish, and Wong 2008), we specified that the model-building procedure begin with the model 
containing baseline math and reading test scores (measured in the grade prior to the CMO middle 
school’s entry grade) and corresponding missing test score indicators. At each subsequent step, the 
stepwise procedure then either adds or subtracts a term from a specified set of potential covariates 
to optimize model fit to the data. The hst of potential covariates included sex, race/ ethnicity, FRPL 
status, lEP status, ELL status, whether a student attended a charter school in the baseline year, 
baseline grade, cohort, district,*^ any two-way interactions of these covariates (including those with 
baseline test scores and corresponding missing test score indicators), and two interactions of test 
scores with themselves (i.e., quadratic terms). 


We did not use pre-baseline (two grades prior to CMO entry) test scores as potential covariates because they 
were missing for a large proportion of students in our sample. However, we used them as control variables in the impact 
model. 
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Propensity model selection was often an iterative process. The Hosmer and Lemeshow (H-L) 
Goodness-of-Fit test^^’ was used, among other indicators, to determine whether the model selected 
by the model selection procedure fits the data well and whether to proceed with selecting a matched 
sample. If there were indications of bad model fit (e.g., small H-L p-value or large standard errors 
for some of the parameters included in the model), we diagnosed and corrected the problems. Most 
of the issues were resolved by detecting and collapsing sparse cells. For example, if the treatment 
group had very few Asian students resulting in high standard errors for the Asian parameter in the 
propensity model, we collapsed Asian with another race/ ethnicity category, such as White/Other. 
We then re-ran the model selection procedure with the collapsed categories and again checked the 
model fit. We used the final propensity model to estimate the propensity scores for each treatment 
and comparison student in our sample for that CMO. 

2. Selecting a Matched Comparison Group 

After estimating the propensity scores, the next step involved selecting a matched comparison 
group whose estimated propensity scores are similar to those of treatment group students. To 
improve statistical precision, we selected multiple matches for each treatment student. To ensure the 
quality of the matches and reduce bias, we matched with replacement (allowing each comparison 
student to match more than one CMO student) and implemented caliper matching, whereby a given 
treatment student is matched to all comparison students with estimated propensity scores within a 
specified range (caliper), rather than merely selecting a specified number of nearest neighbors. We 
specified five calipers, ranging from 10"^ to 10"^. Starting with the smallest caliper, we checked for 
matches. If a treatment student had between 2 and 30 potential matches, all of these comparison 
students were identified as the matches for the given treatment student. If the number of potential 
matches exceeded 30, we identified the 30 comparison students with the closest propensity score 
(that is, the best-matched students) as the matches to this treatment student. If we did not find at 
least two matches, we increased the caliper to the next level and tried again. The matching procedure 
was implemented separately for each grade, cohort, and district combination (we refer to these 
combinations as the matching strata). 

To ensure that we included as many students as possible and to increase the likelihood of a 
successful match, a given treatment (or comparison) student did not have to have test scores for all 
outcomes of interest to be included in the sample to be matched. However, this could have resulted 
in unbalanced treatment and comparison group samples for some outcomes due to differences in 
the samples of students across outcomes. To ensure that our treatment and comparison samples 
were balanced for all outcomes of interest, we created outcome-specific analysis weights. In 
particular, for a given outcome, each treatment student with a valid outcome of interest and at least 
one matched comparison student was assigned a weight of 1. The matched comparison students 
(with a valid outcome of interest) were assigned the analysis weight for the treatment students to 
whom they were matched. In other words, a treatment student and his or her matched 
comparison(s) had to have the same weighted representation within the treatment and comparison 
samples, respectively. A given treatment student could potentially have many matched comparison 


The H-L Goodness-of-Fit statistic is constructed by first dividing the observations into deciles based on their 
predicted probabilities and then calculating the chi-square statistic testing whether the distributions of predicted and 
actual frequencies across deciles are the same. Smaller y>-values indicate worse model fits. 
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students; when this happens, the treatment student’s weight is divided into even-weight shares 
among the matched comparison students, so that collectively the matched comparison students have 
the same weight as the treatment student. Since aU matches are done with replacement, a given 
comparison student could also be matched to many treatment students. In these instances, the 
comparison student’s analysis weight equals the sum of the weights (or weight shares) for aU 
treatment students to whom he or she was matched. Any treatment student with a missing value for 
the outcome or with no matched comparison students with an observed outcome was assigned a 
weight of zero. Similarly, any comparison student who was matched to a treatment student with a 
missing outcome was assigned a zero weight as well. Since we matched students separately within 
each matching stratum, this procedure for calculating weights also ensured that treatment and 
comparison students were weighted proportionally within a given grade, cohort, and district 
combination. Finally, we rescaled the weights to add up to the total number of effective students in 
the analysis sample (i.e. double the number of treatment students with a valid outcome and who also 
has at least one matched comparison student with a valid outcome) for this outcome. 

Using this procedure, we achieved equivalence between treatment and comparison groups on 
baseline test scores and most of the observed key covariates (see Appendix D for more detail). This 
level of baseline equivalence meets the internal validity standards of the What Works Clearinghouse. 
Furthermore, for each CMO, we were able to match between 64 percent and 100 percent of CMO 
enroUees in the primary intake grade who had a valid outcome, with a match rate of at least 90 
percent for most of the test scores that we rely on as key outcomes, enhancing the external validity 
of the analysis.^’ 

3. Estimating CMO-Specific Impacts 

Following the creation of matched samples and analyses weights, we employed a regression 
model to improve statistical precision and to control for any remaining differences in baseline 
characteristics. Our CMO-specific impact regression model is: 


Yis +^iP + SsTi + VjO + £j. 


where yj^ is the subject s test score outcome one, two or three years after initial school 
enrollment for student /; a is the intercept; X; is a vector of achievement (pre-baseline and baseline 
test scores in reading and math, and corresponding missing test score indicators) and demographic 
(race/ ethnicity, FRPL status, lEP status, and ELL status^^) characteristics; T; is a binary variable for 
treatment status, indicating whether student / enrolled in one of the study CMO schools; V; is a 
vector of indicators identifying which grade, cohort and district the student belonged to; s is a 
random error term that reflects the influence of unobserved factors on the outcome; and a, p, 8^, 
and 0 are vectors of parameters or parameters to be estimated. The estimated coefficient on 
treatment status, represents the one, two or three-year impact in subject s of attending a study 
CMO school. We used robust standard errors that adjust for clustering of students within schools 
and also implemented the analyses weights described in the preceding section. 

Match rates were 90 percent or better for 1 7 of the 22 CMOs on two-year reading and math outcomes, for 8 of 
the 1 1 CMOs on the three-year science outcome, and for 6 of the 9 CMOs on the three-year social studies outcome. 

Not all student variables are available in all jurisdictions. 
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4. Sensitivity Analyses 


As mentioned before, if aU covariates that are related to a student’s probability of enrolling in a 
CMO school and that are also related to the outcome of interest are observed and appropriately 
accounted for, then the PSM approach could, in theory, result in an unbiased estimator of the 
impact of attending a CMO school. As described previously, since a large proportion of students 
were missing pre-baseline test scores, we did not use pre-baseline scores in the propensity model 
(though we did use them in the impact regression). However, pre-baseline test scores could be 
related to either a student’s probability of enrolling in a CMO school or the outcome of interest in a 
way that is not fully accounted for by baseline test scores or any of the other matching covariates 
selected by the stepwise model selection procedure. Furthermore, students who are residing in areas 
far away from CMO schools may be different from students who reside closer to CMO schools. For 
example, these students could have attended better (or worse) elementary schools or they could be 
residing in an area with more (or less) resources (e.g., after-hours educational programs or better 
libraries), which would make them better (or less) prepared to succeed in middle school. If so, not 
accounting for the area of residence or the type of school that a student attended prior to enrolling 
in a CMO school (or a comparison middle school) could introduce bias in the estimates. 

Therefore, we explored these possibilities by conducting sensitivity analyses in four CMOs. 
Under this alternative approach, we re-matched students following the procedure outlined above 
with two modifications. First, we restricted our potential comparison group to students who at 
baseline attended one of the schools that the treatment students attended before enrolling in the 
CMO schools (i.e. the feeder schools). Second, along with baseline test scores, we included pre- 
baseline math and reading test scores and corresponding missing test score indicators in the stepwise 
propensity model selection procedure as required covariates. As expected, due to these restrictions 
the potential pool of comparison students was much smaller and we were able to successfully match 
fewer treatment students. In particular, for one of the CMOs, we were only able to match 123 (or 
30%) of the original 409 successfully matched treatment students. Even so, the CMO-specific 
impacts estimates were generally quite similar to our benchmark estimates: on average the impact 
estimates changed by -0.015 and the change never exceeded 0.05 (Table C.l). We concluded that any 
bias from omitting these variables from the match is negligible, and that our primary method is 
preferred because it allows us to include many more students. 


110 



DRAFT 


Table C.l. Main CMO- Specific Impact Estimates Compared to Resuits from the Sensitivity Analyses 


CMO 


Main Results 




Results of the Sensitivity Analyses 


N. 


Impact 

SE 

p-v alue 

N. 

Nc 

Impact 

SE 

p-v alue 

2- Year Math 

D 

837/142 

10,177/963 

-0.12 

0.05 

0.01 

581/104 

5,723/407 

-0.1 1 

0.04 

<0.01 

E 

269/27 

5,331/223 

0.05 

0.1 1 

0.66 

211/26 

1,820/103 

0.10 

0.10 

0.30 

J 

628/20 

1 1,191/221 

0.09 

0.03 

<0.01 

566/20 

7,379/97 

0.08 

0.03 

<0.01 

L 

409/8 

2,801/309 

-0.02 

0.02 

0.35 

123/4 

285/52 

0.01 

0.08 

0.89 

2- 

Year Reading 










D 

853/145 

10,374/970 

-0.10 

0.04 

0.01 

595/106 

5,842/414 

-0.10 

0.04 

<0.01 

E 

269/27 

5,327/223 

-0.13 

0.04 

<0.01 

212/26 

1,837/103 

-0.12 

0.03 

<0.01 

J 

627/20 

1 1,164/221 

0.18 

0.02 

<0.01 

565/20 

7,388/98 

0.19 

0.02 

<0.01 

L 

409/8 

2,830/310 

-0.10 

0.02 

<0.01 

123/4 

289/52 

-0.08 

0.05 

0.10 

Notes; 

(1) The sensitivity analyses 

restricted the potential 

comparison 

group to students 

attending feeder sch ools and controlled 

for pre- 


baselines test scores in the propensity model. (2) N^=s ample sizes for treatment group. N^=sample sizes for matche d comparison 
group. The first number indicates the number of students, while the second number indicate s the number of schools t hese students 
attended in year 2. (3) Standard errors are adjusted for clustering at the school level. 
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APPENDIX D 


BASELINE EQUIVALENCE 

As presented in Chapter Four, the study’s impact analyses used propensity score matching to 
identify groups of comparison students that are similar to CMO students. Tables D.l (for second 
year math impacts) and D.2 (for second year reading impacts) present the average baseline 
characteristics of students in each CMO and its associated comparison group. The tables also report 
the percentage of eligible CMO students who were matched successfully — any CMO students who 
could not be matched were not included in the impact sample. 

For the study’s second-year math impacts, the matching procedure identified a group of 
comparison students with very similar characteristics to CMO students. Out of the 22 CMOs, 
baseline math scores were not significantly different in 20 cases and baseline reading scores were not 
significantly different in 21 cases (Table D.l). For the two CMOs that did have significant 
differences in baseline scores, the difference in mean z-scores was below .15 in both subjects. None 
of the matched comparison groups had significant differences in gender, free and reduced price 
lunch (FRPL) status, or limited English proficiency (LEP) status. Out of 21 CMOs with data on 
individualized education plans (lEP), there were no significant differences in 20 cases. For student 
racial categories (not shown), there were no significant differences in 21 out of 22 cases. Match rates 
for all 22 CMOs were above 70 percent, and for 20 CMOs the study matched more than 80 percent 
of all eligible CMO students. 

The baseline equivalence results are very similar for the second-year reading impacts (Table 
D.2). Baseline scores in the matched comparison group were statistically indistinguishable from the 
CMO group in 20 cases for math and 21 cases for reading; for the two CMOs with significant 
differences, the difference in mean baseline z-scores was less than .15 in both subjects. None of the 
matched comparison groups showed significant differences in gender, FRPL status, or LEP status. 
For lEP status, 20 of the 21 CMOs with data did not have a significant difference compared to the 
matched student group. For student racial categories (not shown), there were no significant 
differences in 21 out of 22 cases. Also, the match rates for students at the 22 CMOs were above 70 
percent in all cases, and in 20 cases the study matched more than 80 percent of all eligible CMO 
students. 

The baseline equivalence results and match rates associated with the other outcome years and 
test subjects presented in this report were similar to the figures shown below. Full baseline 
equivalence and match-rate tables for these other outcomes are available upon request to the study 
authors. 
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Table D.I. Baseline Equivalence of Student Characteristics, Second Year Math Sample 


CMO 

Math 

Reading 

Female 

FRPL Status 

lEP Status 

LEP Status 

Match 

CMO 

Comp. 

CMO 

Comp. 

CMO 

Comp. 

CMO 

Comp. 

CMO 

Comp. 

CMO 

Comp. 

Rate % 

N 

-0.33 

-0.27 

-0.50 

-0.45 

0.52 

0.48 

N/A 

N/A 

0.03 

0.02 

0.65 

0.66 

72 

C 

0.00 

-0.01 

0.01 

-0.01 

0.47 

0.47 

1.00 

1.00 

0.10 

0.09 

0.00 

0.00 

78 

H 

0.44 

0.36 

0.52 

0.46 

0.52 

0.47 

N/A 

N/A 

0.09 

0.08 

0.10 

0.1 1 

81 

E 

0.43 

0.46 

0.34 

0.34 

0.40 

0.45 

0.37 

0.37 

0.02 

0.02 

N/A 

N/A 

81 

D 

-0.43 

-0.37 

-0.25 

-0.22 

0.50 

0.53 

0.83 

0.83 

0.03 

0.02 

N/A 

N/A 

88 

Q 

-0.24 

-0.22 

-0.17 

-0.1 7 

0.46 

0.50 

0.55 

0.57 

0.13 

0.10 

N/A 

N/A 

90 

T 

-0.03 

0.01 

-0.14 

-0.09 

0.50 

0.51 

0.91 

0.90 

0.10 

0.08 

0.37 

0.38 

91 

L 

0.1 1 

0.23* 

0.26 

0.38* 

0.41 

0.44 

N/A 

N/A 

0.08 

0.09 

N/A 

N/A 

93 

P 

-0.02 

0.01 

0.05 

0.05 

0.42 

0.41 

0.81 

0.81 

0.08 

0.08 

N/A 

N/A 

96 

M 

-0.02 

0.00 

0.02 

0.03 

0.49 

0.48 

N/A 

N/A 

0.1 1 

0.09* 

0.09 

0.09 

96 

F 

-0.09 

-0.08 

0.01 

0.03 

0.50 

0.50 

N/A 

N/A 

0.1 1 

0.09 

0.03 

0.03 

98 

K 

0.1 7 

0.14 

0.00 

-0.03 

0.53 

0.53 

N/A 

N/A 

0.19 

0.20 

0.42 

0.40 

98 

J 

0.29 

0.30 

0.26 

0.25 

0.47 

0.48 

0.82 

0.81 

0.03 

0.03 

N/A 

N/A 

99 

C 

-0.1 1 

-0.14 

0.09 

0.08 

0.45 

0.45 

0.66 

0.67 

0.07 

0.06 

0.00 

0.00 

99 

0 

-0.05 

-0.06 

0.07 

0.06 

0.41 

0.42 

N/A 

N/A 

0.1 5 

0.16 

0.02 

0.03 

99 

U 

-0.1 1 

-0.06 

-0.18 

-0.12 

0.49 

0.50 

0.90 

0.91 

0.06 

0.06 

0.30 

0.29 

>99 

A 

0.86 

0.89 

0.87 

0.93 

0.42 

0.43 

N/A 

N/A 

0.04 

0.05 

N/A 

N/A 

>99 

R 

0.29 

0.33 

0.33 

0.33 

0.43 

0.43 

0.81 

0.83 

N/A 

N/A 

N/A 

N/A 

>99 

1 

-0.09 

-0.02* 

-0.04 

0.00 

0.54 

0.55 

0.88 

0.87 

0.02 

0.02 

N/A 

N/A 

>99 

B 

-0.01 

-0.02 

-0.01 

-0.01 

0.48 

0.50 

0.87 

0.88 

0.14 

0.13 

0.33 

0.33 

>99 

V 

-0.05 

-0.01 

0.01 

0.08 

0.54 

0.49 

0.83 

0.85 

0.14 

0.14 

0.06 

0.07 

>99 

S 

0.24 

0.25 

0.21 

0.23 

0.46 

0.47 

0.91 

0.90 

0.01 

0.01 

N/A 

N/A 

>99 


* Difference between the CMO and comparison student group is statistically significant at the five percent level. 
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Table D.2. Baseline Equivalence of Student Characteristics, Second Year Reading Sampie 


CMO 

Math 

Reading 

Femaie 

FRPL Status 

iEP Status 

LEP Status 

Match 

CMO 

Comp. 

CMO 

Comp. 

CMO 

Comp. 

CMO 

Comp. 

CMO 

Comp. 

CMO 

Comp. 

Rate % 

N 

-0.34 

-0.24 

-0.50 

-0.44 

0.52 

0.47 

N/A 

N/A 

0.03 

0.02 

0.65 

0.66 

72 

C 

0.00 

-0.01 

0.01 

-0.01 

0.47 

0.47 

1.00 

1.00 

0.10 

0.09 

0.00 

0.00 

78 

E 

0.43 

0.46 

0.35 

0.35 

0.40 

0.45 

0.37 

0.37 

0.01 

0.02 

N/A 

N/A 

81 

H 

0.45 

0.43 

0.53 

0.50 

0.51 

0.49 

N/A 

N/A 

0.09 

0.08 

0.10 

0.10 

82 

D 

-0.45 

-0.41 

-0.27 

-0.26 

0.50 

0.53 

0.83 

0.83 

0.03 

0.02 

N/A 

N/A 

88 

Q 

-0.23 

-0.21 

-0.1 7 

-0.16 

0.46 

0.50 

0.55 

0.57 

0.13 

0.10 

N/A 

N/A 

90 

T 

-0.02 

0.02 

-0.12 

-0.08 

0.50 

0.51 

0.91 

0.89 

0.10 

0.08 

0.37 

0.38 

91 

L 

0.1 1 

0.22* 

0.26 

0.38* 

0.42 

0.44 

N/A 

N/A 

0.08 

0.09 

N/A 

N/A 

93 

P 

-0.01 

0.01 

0.04 

0.05 

0.42 

0.41 

0.81 

0.81 

0.08 

0.08 

N/A 

N/A 

96 

M 

-0.02 

0.00 

0.02 

0.03 

0.49 

0.48 

N/A 

N/A 

0.1 1 

0.09* 

0.09 

0.09 

96 

F 

-0.10 

-0.08 

0.01 

0.03 

0.50 

0.50 

N/A 

N/A 

0.1 1 

0.09 

0.03 

0.03 

98 

K 

0.17 

0.14 

0.00 

-0.03 

0.53 

0.53 

N/A 

N/A 

0.19 

0.20 

0.42 

0.40 

98 

J 

0.30 

0.30 

0.26 

0.25 

0.47 

0.48 

0.82 

0.81 

0.03 

0.03 

N/A 

N/A 

99 

C 

-0.1 1 

-0.1 1 

0.09 

0.1 1 

0.46 

0.45 

0.66 

0.67 

0.08 

0.06 

0.00 

0.00 

99 

0 

-0.06 

-0.06 

0.07 

0.06 

0.42 

0.42 

N/A 

N/A 

0.1 5 

0.16 

0.02 

0.03 

99 

U 

0.04 

0.07 

-0.08 

-0.03 

0.52 

0.50 

0.91 

0.91 

0.05 

0.05 

0.28 

0.28 

>99 

A 

0.86 

0.89 

0.87 

0.93 

0.42 

0.43 

N/A 

N/A 

0.04 

0.05 

N/A 

N/A 

>99 

R 

0.29 

0.33 

0.33 

0.33 

0.43 

0.44 

0.81 

0.83 

N/A 

N/A 

N/A 

N/A 

>99 

1 

-0.09 

-0.03* 

-0.05 

-0.02 

0.54 

0.55 

0.88 

0.87 

0.03 

0.02 

N/A 

N/A 

>99 

B 

-0.01 

0.02 

0.00 

0.03 

0.49 

0.50 

0.87 

0.88 

0.14 

0.12 

0.33 

0.32 

>99 

V 

-0.05 

-0.01 

0.01 

0.07 

0.54 

0.49 

0.83 

0.85 

0.14 

0.14 

0.06 

0.07 

>99 

S 

0.24 

0.25 

0.21 

0.23 

0.46 

0.47 

0.91 

0.90 

0.02 

0.01 

N/A 

N/A 

>99 


"Difference between the CMO and comparison student group is statistically significant at the five percent ievei 
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APPENDIX E 


METHOD FOR DEALING WITH GRADE REPETITION 

Grade repetition — students retained in a given grade for an additional year, generally because 
they have not accumulated sufficient knowledge to progress to the next grade — complicates the 
analysis of academic achievement presented in Chapter Four. Retained students no longer take the 
same test at the same time as the other students in their cohorts. Since most state assessments are 
not vertically scaled, the scores of students who are retained cannot be directly compared with the 
scores of others in their original cohort who have progressed to the next grade, essentially resulting 
in a missing data problem. One approach to handling this missing data problem would be to drop 
the grade repeaters from the analysis. However, if CMO students repeat grades at different rates 
than comparison students, this may bias our impact estimates. Another option would be to include 
the grade repeaters’ scores as they are, a year later than the comparison students, although this 
would ignore the fact that grade repeaters receive two years to learn material taught to the rest of 
their cohort in a single year. 

Instead, following Tuttle et al (2010), we use information from the past performance of grade 
repeaters to keep them in the analysis. For each grade repeater, in the first year of repetition and 
subsequent years, we impute a score on the cohort-appropriate (rather than grade-appropriate) 
assessment that is equal to the student’s last standardized score prior to the initial instance of grade 
repetition. In other words, we assume that each retained student does neither better nor worse 
(relative to his/her original cohort) than before retention. If the CMO has a positive impact on the 
achievement of grade repeaters and a higher rate of grade repetition than comparison students, this 
will generally produce a conservative estimate of the CMO’s impact. If the CMO has a negative 
impact on the achievement of grade repeaters and a lower rate of grade repetition than comparison 
students, this would cause us to overestimate the CMO’s tme impact. In either case, the adjustment 
is arguably conservative, in the sense that our impact estimates will be biased toward zero. 

Of the 22 CMOs for which we are estimating middle school achievement impacts, 16 have rates 
of grade repetition that differ from rates in their local districts by three percentage points or less; we 
therefore would not expect the assumption about the scores of grade repeaters to make much of a 
difference for most of the CMOs. Five CMOs have notably higher rates of grade repetition than 
their districts, and all five of these have positive math and reading two-year impact estimates. The 
one CMO with a substantially lower rate of grade repetition than its local district has negative math 
and reading two-year impact estimates. If CMOs’ effects on grade repeaters are similar to their 
effects on non-repeaters, the impact estimates of all six of these CMOs may be slightly biased 
toward zero as a result of assuming that their students’ scores are unchanged after grade repetition. 
(Alternatively if the CMOs with positive impacts on non-repeaters have negative impacts on 
repeaters, this method could inflate our estimates of impacts). Appendix Table C.l shows the rates 
of grade repetition for students in each of 22 CMOs with middle school grades and the 
corresponding rates for students in their local districts. 
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Table E.l. Mean Rates of Grade Repetition in Middle School Grades, by Treatment Status 


CMO 

CMO Mean Rate of 
Grade Repetition 

District Mean Rate of Grade 
Repetition 

Difference in Rates 
(CMO - District) 

K 

.03 

.16 

-.12** 

0 

.02 

.05 

-.03** 

P 

.03 

.04 

-.01* 

J 

.02 

.02 

-.01 

E 

.01 

.02 

-.01 

L 

.00 

.01 

.00* 

T 

.00 

.00 

b 

o 

A 

.01 

.01 

.00 

H 

.01 

.01 

.00 

G 

.01 

.00 

.00 

U 

.01 

.00 

.01* 

Q 

.02 

.01 

.01 

B 

.02 

.00 

.02** 

N 

.02 

.00 

.02** 

D 

.04 

.02 

.03** 

F 

.06 

.03 

.03** 

R 

.05 

.02 

.03** 

C 

.13 

.05 

.07** 

M 

.12 

.03 

.09** 

1 

.14 

.03 

.11** 

S 

.14 

.02 

.12** 

V 

.1 7 

.03 

.14** 


Source; Administrative data 

Note: Type text here. 

* Significantly different from zero at the .05 level, two-taile d test 
** Significantly different from zero at the .01 level, two-taile d test. 
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APPENDIX F 


MULTIPLE COMPARISON ADJUSTMENTS FOR IMPACT ANALYSES 


Following the procedures and standards of the What Works Clearinghouse we used the 
Benjamini-Hochberg method to adjust for multiple comparisons for our math and reading impact 
estimates. The method protects against Type 1 errors and specifically against falsely detecting an 
impact when multiple impact hypotheses are tested. Table G.l below presents the p-values of our 
two-year math and reading impact estimates and the corresponding critical p-values (cutoff) for each 
statistically significant impact estimate based on our pre-specified level of statistical significance of 5 
percent. We observe that all impacts that were statistically significant at the 5 percent level remain 
statistically significant after applying the Benjamini-Hochberg correction. 


Table F.I. P- Values of Two-Year Math and Reading CMO Impacts and Corresponding Critical P- 
Values after Benjamin!- Hochberg Correction for Multiple Comparisons 


CMO 

Math 

Reading 

p-value 

Critical p 

p-value 

Critical p 

A 

0 . 00 ** 

0.03 

0.07 

N/A 

B 

0 . 00 ** 

0.02 

0 . 01 ** 

0.03 

C 

0 . 00 ** 

0.00 

0 . 00 ** 

0.02 

D 

0 . 01 * 

0.04 

0 . 01 * 

0.04 

E 

0.66 

N/A 

0 . 00 ** 

0.03 

F 

0 . 00 ** 

0.01 

0.07 

N/A 

G 

0 . 01 * 

0.04 

0 . 00 ** 

0.01 

H 

0 . 00 ** 

0.02 

0 . 00 ** 

0.01 

1 

0 . 01 ** 

0.04 

0 . 00 ** 

0.02 

J 

0 . 00 ** 

0.03 

0 . 00 ** 

0.00 

K 

0 . 00 ** 

0.00 

0 . 00 ** 

0.00 

L 

0.35 

N/A 

0 . 00 ** 

0.01 

M 

0 . 00 ** 

0.01 

0 . 00 ** 

0.03 

N 

0 . 00 ** 

0.02 

0 . 00 ** 

0.02 

0 

0 . 00 ** 

0.03 

0.1 5 

N/A 

P 

0 . 00 ** 

0.03 

0 . 00 ** 

0.02 

Q 

0 . 00 ** 

0.03 

0.06 

N/A 

R 

0.22 

N/A 

0.70 

N/A 

S 

0 . 00 ** 

0.01 

0 . 00 ** 

0.03 

T 

0 . 00 ** 

0.02 

0 . 01 ** 

0.03 

U 

0.65 

N/A 

0.14 

N/A 

V 

0 . 00 ** 

0.01 

0 . 00 ** 

0.01 


Notes: P-values that are in bold indicate impa cts that remain statistically significant a fter appiying 

the Bejamini-Hochbe rg correction for multiple comparisons. A critical p is only calculated for 
impacts that were statistically significant at the 5 percent level. Results insignificant before 
the correction remain insignificant. 

^Significantly different from zero at the .05 level, two-taile d test 
**Significantly different from zero at the .01 level, two-taile d test. 

References: 

U.S. Department of Education. What Works Clearinghouse Procedures and Standards Handbook Version 
2.1 . Washington, DC: U.S Department of Education, December 2008. 
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APPENDIX G 


IMPACTS ON MIDDLE SCHOOL TEST SCORES BY CMO, YEAR, AND SUBJECT 


Table G.I. Math Test- Score Impacts in Middle Schools, by CMO and Number of Years after 
Enrollment (Impact Units are Standard Deviation Effect Sizes) 


CMO 

1 -Year 

2-Year 

3-Year 

A 

-0.06 

-0.1 3** 

N/A 


(N=395) 

(N=179) 


B 

0.16* 

0.36** 

N/A 


(N=l,492) 

(N=l,042) 


C 

0.47** 

0.63** 

0.74** 


(N=710) 

(N=500) 

(N=31 5) 

D 

-0.12** 

-0.12* 

-0.15* 


(N=l,078) 

(N=837) 

(N=223) 

E 

0.03 

0.05 

-0.13** 


(N=502) 

(N=269) 

(N=67) 

F 

-0.02 

0.28** 

0.34** 


(N=l,157) 

(N=826) 

(N=482) 

G 

0.16** 

0.31* 

N/A 


(N=858) 

(N=534) 


H 

-0.35** 

-0.30** 

N/A 


(N=658) 

(N=499) 


1 

-0.14** 

0.12** 

0.22** 


(N=l,276) 

(N=961) 

(N=637) 

J 

0.00 

0.09** 

0.1 1* 


(N=916) 

(N=628) 

(N=357) 

K 

-0.27** 

-0.21** 

-0.23** 


(N=517) 

(N=403) 

(N=279) 

L 

0.03 

-0.02 

-0.09** 


(N=544) 

(N=409) 

(N=291) 

M 

0.35** 

0.50** 

0.48** 


(N=l ,41 7) 

(N=l,125) 

(N=858) 

N 

-0.1 1** 

-0.27** 

N/A 


(N=323) 

(N=207) 


0 

-0.04** 

-0.09** 

-0.1 1** 


(N=569) 

(N=422) 

(N=188) 

P 

0.12** 

0.17** 

N/A 


(N=792) 

(N=746) 


Q 

-0.1 9** 

-0.26** 

-0.05 


(N=489) 

(N=342) 

(N=208) 

R 

-0.13** 

-0.05 

-0.04 


(N=545) 

(N=428) 

(N=371) 

S 

0.39** 

0.40** 

0.27** 


(N=2,309) 

(N=l,766) 

(N=l,245) 

T 

0.58** 

0.42** 

N/A 


(N=773) 

(N=519) 


U 

0.10** 

0.02 

N/A 


(N=805) 

(N=449) 


V 

0.30** 

0.55** 

0.72** 


(N=481) 

(N=343) 

(N=226) 

Number of CMOs 

22 

22 

14 
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Table G.l (continued) 


Source: State, district and CMO school records 

Notes: To account for sam pie differences across outcomes and ou r propensity score matching 

approach which ailowed for multiple matches, student-level observations were weighted such 
that collectively, the treatment group has the same weight as the matched comparison grou p. 
Since we matched studen ts separately within each matching strata (district/state-cohort- 
grade), treatment and matched comparison students were also weighted proportionally within 
each strata. The sample sizes in the table report the number of unique treatment students for 
each CMO. Additionally, we calculated robust standard errors that were clustered at the school 
level. Refer to Appendix C for more details on the impact estimation procedure. 

’■Significantly different from zero at the .05 level, two-tailed test. 

**Significantly different from zero at the .01 level, two-tailed test. 
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Table G.2. Reading Test- Score Impacts in Middie Schoois, by CMO and Number of Years after 
Enroiiment (Impact Units are Standard Deviation Effect Sizes) 


CMO 

1 -Year 

2-Year 

3-Year 

A 

-0.01 

(N=395) 

-0.09 

(N=179) 

N/A 

B 

0.09 

0.18** 

0.13 


(N=l,493) 

(N=l,052) 

(N=763) 

C 

0.14* 

0.22** 

0.31** 


(N=710) 

(N=500) 

(N=315) 

D 

-0.14** 

-0.10* 

-0.01 


(N=l,082) 

(N=853) 

(N=226) 

E 

-0.10** 

-0.1 3** 

-0.1 1 


(N=503) 

(N=269) 

(N=71) 

F 

-0.10** 

-0.05 

0.00 


(N=l,161) 

(N=824) 

(N=478) 

G 

0.17** 

0.20** 

0.28** 


(N=860) 

(N=548) 

(N=304) 

H 

-0.14** 

-0.1 5** 

-0.30** 


(N=660) 

(N=509) 

(N=376) 

1 

-0.05 

0.13** 

0.1 3** 


(N=l,289) 

(N=970) 

(N=641) 

J 

0.07** 

0.18** 

0.10** 


(N=916) 

(N=627) 

(N=358) 

K 

-0.20** 

-0.1 7** 

-0.21** 


(N=523) 

(N=404) 

(N=280) 

L 

-0.1 5** 

-0.10** 

-0.25** 


(N=544) 

(N=409) 

(N=291) 

M 

0.07 

0.22** 

0.31** 


(N=l,416) 

(N=l,126) 

(N=857) 

N 

-0.14** 

-0.22** 

0.00 


(N=325) 

(N=208) 

(N=131) 

0 

-0.05 

-0.07 

-0.03 


(N=570) 

(N=423) 

(N=189) 

P 

0.12** 

(N=791) 

0.16** 

(N=748) 

N/A 

Q 

-0.10 

-0.13 

-0.06 


(N=489) 

(N=343) 

(N=208) 

R 

-0.07** 

0.01 

0.03 


(N=545) 

(N=426) 

(N=374) 

S 

0.10** 

0.08** 

0.03 


(N=2,318) 

(N=l,770) 

(N=l,254) 

T 

0.1 5** 

0.24** 

0.30* 


(N=772) 

(N=522) 

(N=389) 

U 

0.03** 

0.06 

0.16* 


(N=925) 

(N=621) 

(N=401) 

V 

0.06 

0.23** 

0.22** 


(N=482) 

(N=343) 

(N=225) 

Number of CMOs 

22 

22 

20 


Source: State, district and CMO school records 

Notes: To account for sam pie differences across outcomes and ou r propensity score matching 

approach which allowed for multiple matches, student-level observations were weighted such 
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Table G.2 (continued) 


that collectively, the treatment group has the same weight as the matched comparison grou p. 
Since we matched studen ts separately within each matching strata (district/state-cohort- 
grade), treatment and matched comparison students were also weighted proportionally within 
each strata. The sample sizes in the table report the number of unique treatment students for 
each CMO. Additionally, we calculated robust standard errors that were clustered at the school 
level. Refer to Appendix C for more details on the impact estimation procedure. 


*Significantly different from zero at the .05 level, two-tailed test. 
**Significantly different from zero at the .01 level, two-tailed test. 
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Table G3. Science Test- Score Impacts in Middle Schools, by CMO and Number of Years after 
Enrollment (Impact Units are Standard Deviation Effect Sizes) 


CMO 

3-Year 

A 

N/A 

B 

0.21 

(N=744) 

C 

N/A 

D 

N/A 

E 

-0.1 7 * ** 
(N=67) 

F 

N/A 

G 

0.61** 

(N=301) 

H 

-0.49** 

(N=367) 

1 

0.03 

(N=104) 

J 

0.31** 

(N=352) 

K 

N/A 

L 

-0.1 1** 
(N=188) 

M 

N/A 

N 

-0.1 1 
(N=125) 

0 

N/A 

P 

N/A 

Q 

N/A 

R 

0.06 

(N=350) 

S 

0.32** 

(N=l,004) 

T 

N/A 

U 

0.01 

(N=201) 

V 

N/A 

Number of CMOs 

11 


Source: State, district and CMO school records 

Notes; To account for sam pie differences across outcomes and ou r propensity score matching 
approach which allowed for multiple matches, student-level observations were weighted such 
that collectively, the treatment group has the same weight as the matched comparison grou p. 
Since we matched studen ts separately within each matching strata (district/state-cohort- 
grade), treatment and matched comparison students were also weighted proportionally within 
each strata. The sample sizes in the table report the number of unique treatment students for 
each CMO. Additionally, we calculated robust standard errors that were clustered at the school 
level. Refer to Appendix C for more details on the impact estimation procedure. 

*Significantly different from zero at the .05 level, two-tailed test. 

**Significantly different from zero at the .01 level, two-tailed test. 
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Table G^. Social Studies Test- Score Impacts in Middle Schools, by CMO and Number of Years after 
Enrollment (Impact Units are in Standard Deviation Effect Sizes) 


CMO 

3-Year 

A 

N/A 

B 

0.12 

(N=747) 

C 

N/A 

D 

N/A 

E 

-0.02 

(N=68) 

F 

N/A 

G 

0.22** 

(N=307) 

H 

-0.48** 

(N=371) 

1 

N/A 

J 

0.40** 

(N=351) 

K 

N/A 

L 

N/A 

M 

N/A 

N 

-0.03 

(N=128) 

0 

N/A 

P 

N/A 

Q 

N/A 

R 

0.15* 

(N=350) 

S 

0.1 9** 
(N=l,004) 

T 

N/A 

U 

0.31** 

(N=203) 

V 

N/A 

Number of CMOs 

9 


Source: State, district and CMO school records 

Notes; To account for sam pie differences across outcomes and ou r propensity score matching 
approach which ailowed for muitiple matches, student-level observations were weighted such 
that collectively, the treatment group has the same weight as the matched comparison grou p. 
Since we matched studen ts separately within each matching strata (district/state-cohort- 
grade), treatment and matched comparison students were also weighted proportionally within 
each strata. The sample sizes in the table report the number of unique treatment students for 
each CMO. Additionally, we calculated robust standard errors that were clustered at the school 
level. Refer to Appendix C for more details on the impact estimation procedure. 

•'Significantly different from zero at the .05 level, two-tailed test. 

**Significantly different from zero at the .01 level, two-tailed test. 
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APPENDIX H 


COMPARING CMO AND INDEPENDENT CHARTER IMPACTS 

Many CMOs began as attempts to scale-up educational approaches that appeared effective at 
individual charter schools. Reporting nationwide CMO impacts raises questions about how those 
impacts differ from impacts of charter schools not part of CMOs (here labeled independent charters). 
Are more effective charter approaches the ones that are scaled by CMOs? Do CMOs’ scale provide 
them with advantages or disadvantages over independent charters? To provide a comparison, we 
estimated impacts for CMOs and independent charters in four large school districts. Clearly, any 
differences between CMO and independent charter impacts in these districts cannot be reliably 
attributed to CMO management; there are many potential differences between sample CMO schools 
and independent charter schools that are not inherently due to CMO management (for example, 
CMO schools may be older than independent charter schools or have different intake grades).^^ 
Instead, the comparison provides a non-causal reference for CMO impacts. The diversity of CMO 
impacts suggests that any comparison with independent charters will likely vary, and impacts across 
four districts indicate that is the case — some CMO had higher impacts than independent charters in 
the same district and some CMOs had lower impacts. 

1. Data and Methods 

a. Sample and Data 

The comparison reported in this appendix involves CMO and independent charter schools in 
four districts. These districts were selected for having the largest number of independent charter 
middle schools from among all the states and districts who provided student records data that 
indicated whether students attended a charter school. Limiting to jurisdictions with clear and clean 
charter school indicators ensured that we could reliably identify all charters schools in the district. 
The four districts include 3 of the largest 20 school districts in the United States and are located in 
three of the four national census regions. 

As with the overall impact analysis, this comparison is limited to middle schools. Each of the 
four districts included three or more independent charter middle schools.^ District A had three 
CMOs with middle schools. Districts B and C had middle schools from two CMOs, and District D 
had middle schools from one CMO. 

b. Methods 

Impacts were estimated using OLS regression with statistical controls for baseline achievement 
and demographic characteristics. The validation exercise (Appendix B) indicated that OLS 
regression estimated impacts very close to experimental impact estimates. Specifically, the OLS 
estimates were usually within .03 of the experimental estimates, and the correlation with the 


In addition, these results cannot be generali2ed to all CMOs or independent charters since this sample of 
districts was not representative, but a convenience sample chosen based on data availability. 

The exact number of independent charters in each district is not revealed to prevent identification of districts. 
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experimental estimates was .99 for math and .88 for reading. Given that OLS provides rigorous 
impact estimates with more efficient estimation than the propensity score matching used to estimate 
primary study impacts (OLS does not require a matching process), we chose to use OLS.^^ 

For each district, impacts were estimated for all independent charter middle schools (pooled 
estimate) as well as each CMO’s middle schools. To estimate impacts, we compared reading and 
math achievement outcomes of CMO/independent charter enroUees to those of CMO/independent 
charter non-enroUees^*" in the same district and grade during the same year, controlling for students’ 
previous test scores and demographic characteristics. The impact model is: 

(1) Yi — +^iP + 8Tj+£i, 


where y; is the reading or math test score outcome for student i two years after the most 
common enrollment grade (for example, if the most common enrollment grade at a school was fifth 
grade, this would be the sixth grade) a is the intercept; X; is a vector of previous achievement and 
demographic characteristics;^* T; is a binary variable for treatment status, indicating whether student i 
was enrolled in a CMO/independent charter during intake grade; s is a random error term that 
reflects the influence of unobserved factors on the outcome; and p and 8 are vectors of parameters 
or parameters to be estimated. The estimated coefficient on treatment status, 8, is the two-year 
impact estimate. 


We addressed school attrition, grade retention, and missing data in the same manner as the 
primary impact analysis (see Chapter IV and Appendices C and E). Since CMO and district 
independent charter estimates includes multiple cohorts, it is possible that one cohort of comparison 
students but a different cohort of charter students could drive the results (the number of charter 
students generally increases with the length of time the school has been operating). Like the primary 
impact analysis, we used a weighting approach to make the contribution of each cohort to the 
impact estimate proportional to charter student (treatment group) sample size. Specifically, each 
CMO/independent charter student was assigned a weight of 1 and each comparison student was 

■ N CHARTER 


assigned a weight of 


^district' 


where N 


CHARTER 


and N 


DISTRICT 


are charter and district sample sizes by 


For each of the eight CMOs, the propensity-score impact estimates from the primary analysis and OLS impact 
estimates were always within .06 standard deviations except for the ELA estimate for CMO 8. For this CMO, the 
propensity estimates included a school outside the district and that may partially explain the difference. 

Independent charter schools were eligible comparison students for CMO and CMO students were eligible 
comparison students for independent charter students. In both cases, charter students were a small portion of the 
comparison group. 

As with the primary impact analysis in this study, two year impacts were chosen because they estimate more 
relevant longer-term impacts; three-year or four-year impacts were not possible because some students took course- 
specific math tests in their third and fourth years and the course taken could be affected by treatment. 

The characteristics were: pre-baseline (two grades prior to entry) and baseline test scores in reading and math 
with corresponding missing test score indicators; race /ethnicity; poverty status (as measured by eligibility for free or 
reduced-price lunches); special education status, ELL status, and whether the student was enrolled in a charter school at 
baseline. If the charter schools in the analysis had multiple entry grades, a covariate for entry grade was also included in 
the analysis. 
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grade (if multiple intake grades), cohort, and outcome (reading or math). Weights were rescaled such 
that the sum of weights equaled the total number of students in the analysis. 

2. Results 

In the one district with three CMOs, CMO middle schools generally had more positive impacts 
than independent charters, and in the other three districts CMOs generally had more negative 
impacts than independent charters. Although the trend slightly favored independent charters, that 
may simply result from the districts sampled, a convenience sample not randomly selected.^^ Two 
CMOs had larger positive impacts than the average independent charter in their district. Five CMOs 
had negative impacts in districts where the average independent charter impact was positive. As 
noted, any observed differences between CMO and independent charter impacts may not be caused 
by CMO management, but may result from other differences between CMOs and independent 
charters in the sample districts. 

Table H.I. CMO and independent Charter Impacts in Four Districts 


Math Reading 



* Significantly different from zero at the .05 level, two tailed test 
** Significantly different from zero at the .01 level, two tailed test 


For example, five of the eight CMOs in this sample (about 63 percent) had statistically significant negative 
impacts in math, but in the primary impact analysis only 7 of 22 did (approximately 32 percent). 
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APPENDIX I 


SUBGROUP IMPACTS 

We examined whether CMO schools have a differential impact on two-year math and reading 
test scores for five subgroups of students. The five subgroups are: males vs. females, African 
American vs. non-African American students, Hispanic vs. non-Hispanic students, students eligible 
for free- or reduced-price lunch (FRPL) vs. students who are not eligible for FRPL, and those below 
vs. above average on baseline test scores (where the average is based on the baseline scores of all 
students in either the district or state in the same subject). 

The subgroup analyses were performed on students who were in our main impact analysis 
sample by augmenting our CMO-specific impact model with an interaction between treatment 
group status and the subgroup indicator of interest. See Appendix C for details on our CMO- 
specific impact model. The standard errors were adjusted for clustering of students within schools. 

Due to their exploratory nature, we did not adjust subgroup impact results for multiple 
comparisons. Furthermore, some subgroup analyses were performed in a limited number of CMOs 
due to insufficient data (for example, missing FRPL status) or homogeneous CMO student 
populations (defined as less than 15% or greater than 85% of the treatment group sample belonged 
to a specific subgroup). Thus, these results should be interpreted with caution. 

Overall, we found little evidence that the CMO-specific impacts consistently vary by subgroup. 
However, as noted in chapter IV, for math, Hispanics had significantly larger impacts than non- 
Hispanics in five out of the nine CMOs for which we could estimate impacts for Hispanics; 
Hispanics had significantly smaller impacts in only one of these CMOs. And in reading, Hispanics 
had significantly larger impacts in four out of nine of these CMOs and there were no CMOs where 
Hispanics had significantly smaller impacts. 

Tables I.l and 1.2 below present detailed results of all five subgroup differences in impacts on 
two-year math and reading test scores, respectively. 
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Table 1.1. Subgroup Differences in Impacts on Students' Two- Year Math Test Score, by CMO 


CMO 

Male 

(vs. Female) 

African American 
(vs. non-Afric an 
American) 

Hispanic 
(vs. non- 
Hispanic) 

FRPL-e ligible 
(vs. non FRPL- 
eligible) 

Below Average 
Baseline 
Achievement” 
(vs. Above 
Average Baseline 
Achievement) 

A 

-0.01 

N/A 

N/A 

N/A 

N/A*- 


(0.07) 





B 

-0.05 

N/A 

N/A 

N/A 

-0.07 


(0.04) 




(0.05) 

C 

-0.01 

N/A 

N/A 

N/A 

0.01 


(0.07) 




(0.06) 

D 

0.08 

0.16* 

-0.1 7* 

-0.18* 

0.02 


(0.07) 

(0.07) 

(0.07) 

(0.09) 

(0.09) 

E 

0.1 1 

-0.13 

0.52** 

0.45** 

0.12 


(0.09) 

(0.13) 

(0.17) 

(0.11) 

(0.15) 

F 

-0.06 

N/A 

N/A 

N/A 

0.09 


(0.04) 




(0.06) 

G 

0.06 

N/A 

N/A 

0.01 

-0.06 


(0.03) 



(0.07) 

(0.10) 

H 

-0.01 

N/A 

0.02 

N/A 

0.09* 


(0.07) 


(0.05) 


(0.04) 

1 

0.04 

-0.18** 

0.18** 

N/A 

-0.10* 


(0.05) 

(0.04) 

(0.04) 


(0.05) 

J 

-0.06 

N/A 

N/A 

-0.01 

-0.08 


(0.03) 



(0.05) 

(0.06) 

K 

-0.05 

N/A 

0.1 5** 

N/A 

0.13** 


(0.03) 


(0.03) 


(0.03) 

L 

0.09* 

N/A 

N/A 

N/A 

0.05 


(0.04) 




(0.04) 

M 

0.02 

-0.17 

0.31* 

N/A 

0.06** 


(0.03) 

(0.11) 

(0.12) 


(0.02) 

N 

0.00 

N/A 

N/A 

N/A 

0.13 


(0.05) 




(0.07) 

0 

0.01 

-0.12* 

0.12* 

N/A 

0.13** 


(0.04) 

(0.05) 

(0.05) 


(0.02) 

P 

-0.02 

N/A 

N/A 

N/A 

0.01 


(0.05) 




(0.05) 

Q 

-0.24** 

0.08 

N/A 

0.05 

-0.12 


(0.09) 

(0.09) 


(0.13) 

(0.13) 

R 

0.10 

0.03 

0.01 

-0.1 1 

0.04 


(0.05) 

(0.08) 

(0.09) 

(0.11) 

(0.09) 

S 

0.01 

N/A 

N/A 

N/A 

-0.05 


(0.02) 




(0.08) 

T 

-0.04 

N/A 

N/A 

0.40* 

-0.05 


(0.06) 



(0.19) 

(0.06) 

U 

0.03 

N/A 

N/A 

N/A 

0.14 


(0.07) 




(0.15) 

V 

-0.06* 

0.02 

0.1 1 

N/A 

0.07 


(0.02) 

(0.06) 

(0.07) 


(0.06) 
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Table 1.1 (continued) 


Source: State, district and CMO school records 

Note: The differences in impacts of being enroiled in a CMO schooi by subgroups were estimated by 

including an appropriate interaction between treatment status and subgroup indicator in the 
impact regression model. Standard errors, presented in parenthesis, account for clustering of 
students within schools. CMOs that had less th an 1 5% or greater than 85% of students in a 
given subgroup in the analysis sample were excluded from these analyses. 

""Below Average Baseline Achievement students were defined as students who performed below the mean 
for their district or state on the baseline math test. 

'"Less than 1 5% of both treatment and matched comparison students were low achieving students. 

*Significantly different from zero at the 0.05 level, two-taile d test. 

**Significantly different from zero at the 0.01 level, two-taile d test. 
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Table 1.2. Subgroup Differences In Impacts on Students' Two- Year Reading Test Score, by CMO 


CMO 

Male 

(vs. Female) 

African American 
(vs. non-Afric an 
American) 

Hispanic 
(vs. non- 
Hispanic) 

FRPL-e ligible 
(vs. non FRPL- 
eligible) 

Below Average 
Baseline 
Achievement” 
(vs. Above 
Average Baseline 
Achievement) 

A 

-0.04 

N/A 

N/A 

N/A 

N/A*- 


(0.09) 





B 

-0.01 

N/A 

N/A 

N/A 

0.04 


(0.03) 




(0.04) 

C 

0.06 

N/A 

N/A 

N/A 

0.02 


(0.07) 




(0.07) 

D 

0.12* 

-0.02 

0.09 

-0.07 

-0.07 


(0.05) 

(0.07) 

(0.06) 

(0.13) 

(0.06) 

E 

-0.03 

0.34** 

0.04 

0.15* 

0.16* 


(0.07) 

(0.07) 

(0.06) 

(0.06) 

(0.06) 

F 

-0.03 

N/A 

N/A 

N/A 

-0.07 


(0.05) 




(0.05) 

G 

0.00 

N/A 

N/A 

0.08 

0.02 


(0.03) 



(0.05) 

(0.06) 

H 

-0.10 

N/A 

0.14** 

N/A 

0.09 


(0.09) 


(0.04) 


(0.06) 

1 

-0.01 

0.01 

0.00 

N/A 

-0.1 1* 


(0.03) 

(0.05) 

(0.05) 


(0.05) 

J 

-0.10** 

N/A 

N/A 

-0.09 

0.10** 


(0.04) 



(0.05) 

(0.03) 

K 

0.01 

N/A 

N/A 

0.36** 

0.39 


(0.06) 



(0.08) 

(0.26) 

L 

0.19** 

N/A 

N/A 

N/A 

0.02 


(0.04) 




(0.04) 

M 

0.01 

-0.17 

0.25** 

N/A 

0.03 


(0.02) 

(0.09) 

(0.09) 


(0.05) 

N 

-0.05 

N/A 

N/A 

N/A 

0.01 


(0.06) 




(0.06) 

0 

0.08 

-0.19** 

0.19** 

N/A 

-0.02 


(0.06) 

(0.06) 

(0.06) 


(0.03) 

P 

0.03 

N/A 

N/A 

N/A 

0.08* 


(0.04) 




(0.04) 

Q 

-0.13 

0.00 

N/A 

0.00 

-0.20* 


(0.09) 

(0.13) 


(0.12) 

(0.09) 

R 

0.01 

0.18** 

-0.02 

0.13 

-0.08 


(0.05) 

(0.07) 

(0.08) 

(0.10) 

(0.08) 

S 

0.00 

N/A 

N/A 

N/A 

0.01 


(0.03) 




(0.03) 

T 

0.00 

N/A 

0.03 

N/A 

0.04 


(0.03) 


(0.03) 


(0.03) 

U 

0.03* 

N/A 

N/A 

N/A 

0.1 5** 


(0.01) 




(0.02) 

V 

-0.01 

-0.07* 

0.14** 

N/A 

-0.02 


(0.06) 

(0.03) 

(0.03) 


(0.05) 
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Table 1.2 (continued) 


Source: State, district and CMO school records 

Note: The differences in impacts of being enroiled in a CMO schooi by subgroups were estimated by 

including an appropriate interaction between treatment status and subgroup indicator in the 
impact regression model. Standard errors, presented in parenthesis, account for clustering of 
students within schools. CMOs that had less th an 1 S% or greater than 85% of students in a 
given category in the anaiyses sample were excluded from these analyses. 

""Below Average Baseline Achievement students were defined as students who performed below the mean 
for their district or state on the baseline reading test 

'"Less than 1 5% of both treatment and matched comparison students were low achieving students 

*Significantly different from zero at the 0.05 level, two-taile d test. 

**Significantly different from zero at the 0.01 level, two-taile d test. 
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APPENDIX J 

METHODS FOR CORRELATING IMPACTS AND CMO CHARACTERISTICS 


Chapter V presents our analysis of the relationship between CMO impacts on student 
achievement and various CMO practices, stmctures, and contextual factors. These analyses include 
both (1) bivariate associations of impacts against individual CMO characteristics, and (2) multivariate 
analyses where impacts are regressed on several CMO characteristics. 

The bivariate analyses make use of bivariate ordinary least squares (OLS) models to gauge the 
correlation between student impacts and CMO practices. The outcome variables are the estimated 
two-year middle school impacts of the CMO on students’ achievement in reading and math. We 
treat each estimated impact as if it were the result of a mini-study, using robust standard errors to 
account for the fact that our outcome variable is an estimated parameter, rather than an observed 
value (Lewis and Linzer 2005). We present the results of two-tailed tests throughout. Each of our 
primary hypotheses and many of our secondary hypotheses were specified with reference to a single 
direction of association. For these hypotheses, one could argue that one-tailed tests would be 
appropriate. Nonetheless, to be conservative, we present the results of two-tailed tests. (If one 
believes a one-tailed test is appropriate, one could divide the p values and significance thresholds in 
half). 

The multivariate analyses employ multivariate OLS regressions. Again we use robust standard 
errors and present the results of two-tailed tests. 

The independent variables in the correlational analyses are CMO practices, stmctures, and 
contextual factors. Most of these variables are constructed from responses to the Principal Survey. 
For these analyses, we have data for 19 CMOs and 219 original CMO principal responses (or 294 
responses after imputation); these are the CMOs for which we have both estimates of impacts and 
principal survey data. For some secondary hypotheses and intermediate outcomes, we also make use 
of measures constmcted from responses to the CMO Central Office Staff Survey (for which we 
have 17 CMOs that also have estimated impacts) and the Teacher Survey (for which we have data 
for 12 CMOs from 384 teachers). 

The association between CMOs’ impacts and their practices is meaningful only if there is 
sufficient variation in impacts. A Q-test conducted on the 19 CMOs with impact data and data from 
the Principal Survey strongly rejected the null hypothesis of homogeneity of impacts (p<0.001 for 
both subjects), so it is appropriate to attempt to explain variation in student impacts with variation 
in CMOs’ practices. 

Variable Construction 

Measures drawn from the Teacher Survey and the CMO Central Office Staff Survey are 
constructed in the manner described in Chapter III. Each coefficient can be interpreted as the 
expected change in impacts for an increase of one standard deviation in the CMO (or CMO- 
average) report of that practice. 

For measures based on the Principal Survey, we are able to use responses from principals of 
both CMO schools and matched district comparison schools to generate a measure of the 
divergence between the practices of CMO schools and nearby district schools. This is comparable to 
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the conceptualization of student impacts, which estimate the difference between a student’s 
achievement in CMO schools as opposed to if she had attended a district school. We make use of 
the paired schools and create a school-level difference between the measure for the CMO principal 
and the measure for the district principal. These differences are then aggregated up to the CMO 
level. 

Imputation 

All data from the Principal Survey are multiply imputed to account for nonresponse. Multiple 
imputation is particularly valuable for measures drawn from the Principal Survey because they are 
constructed at the school-pair level. By multiply imputing, we are able to retain observations from 
school pairs in which one of the principals responded to the survey and the other did not. Multiple 
imputation also allows us to take into account uncertainty concerning the responses of 
nonresponding principals. If we constmcted a CMO-level average measure based only on the 
responses that we have, we would overstate our degree of certainty in the CMO average. In our 
multiply imputed dataset, this uncertainty will be reflected by somewhat different CMO-level 
averages in each imputation. 

We use five imputations and the ICE procedure in Stata. The imputation model is at the 
school-pair level. In other words, paired schools appear as a single observation in the dataset, with 
responses from the CMO school principal and the district school principal treated as separate 
variables within the observation. The imputation models include: 

• School traits: district of the comparison school (when possible), state, and school level 
(elementary, middle, high). 

• Demographic features of the CMO school and the comparison district school: pupil- 
teacher ratio, percent racial minority in the student body, and percent of the student 
body that is eligible for free or reduced-price lunch (FRPL). 

• Two-year middle school impacts in math and reading.30 

• Survey measures: survey response to the same item from the other school in the pair, 
survey response to other items in the same index by both schools in the pair (where 
appropriate). 


30 Including the outcome variable in the imputation model may seem circular, but is the preferred approach for 
reducing bias in estimated coefficients based on imputed data (Moons et al. 2006). 
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APPENDIX K: CORRELATIONAL ANALYSIS RESULTS 


U) 


Table K.1. CMO Impacts with Rankings on Baseline Scores, Size, and Practices 


CMO 

Math 

Impact 

Reading 

Impact 

Baseline 

Math 

Baseline 

Reading 

CMO 

Size 

Behavior 

Policy 

Formative 

Assessment 

Educational 

Approach 

Teacher 

Coaching 

Performance 
- Based 
Comp. 

Instruct. 

Time 


(1) 

(2) 

(3) 

(4) 

(5) 

(6) 

(7) 

(8) 

(9) 

(10) 

(11) 

c 

0.63** 

0.22** 

Medium 

Medium 

Small 

High 

Medium 

Low 

High 

Low 

High 

V 

0.55** 

0.23** 

Medium 

Medium 

Large 

High 

Low 

Medium 

High 

High 

High 

M 

0.50** 

0.22** 

Medium 

Medium 

Small 

N/A 

N/A 

N/A 

N/A 

N/A 

N/A 

T 

0.42** 

0.24** 

Medium 

Low 

Large 

Medium 

High 

High High 


High 

Medium 

S 

0.40** 

0.08** 

High 

High 

Small 

High 

Medium 

Low 

High 

Medium 

High 

B 

0.36** 

0.1 8** 

Medium 

Low 

Large 

Medium 

Medium 

Medium 

Medium 

Low 

Medium 

G 

0.31* 

0.20** 

Low 

Medium 

Large 

High 

Medium 

High 

Medium 

Medium 

Low 

F 

0.28** 

-0.05 

Low 

Medium 

Large 

High 

High 

Medium 

Medium 

Medium 

High 

P 

0.17** 

0.16** 

Medium 

Medium 

Large 

Medium 

High 

High 

High 

High 

Medium 

1 

0.12** 

0.1 3** 

Low 

Low Large 


Medium 

Low 

Low Medium 


High 

High 

J 

0.09** 

0.18** 

High 

High 

Large 

Medium 

Medium 

High 

High 

High 

High 

E 

0.05 

-0.1 3** 

High 

High Sm 

all 

N/A 

N/A 

N/A 

N/A 

N/A 

N/A 

U 

0.02 

0.06 

Low 

Low 

Large 

Medium 

Medium 

Medium 

Low 

Medium 

Medium 

L 

-0.02 

-0.10** 

High 

High Sma 

II 

High 

Low 

Low Medium 


Low 

Low 

R 

-0.05 

0.01 

High 

High 

Large 

Low 

Medium 

High 

Low 

High 

Medium 

0 

-0.09** 

-0.07 

Medium 

Medium Sma II 

Low 

High 

Medium 

Medium 

Low 

Low 

D 

-0.12* 

-0.10* 

Low 

Low 

Small 

Medium 

High 

N/A 

Medium 

Medium 

Medium 

A 

-0.13** 

-0.09 

High 

High Sm 

all 

Low 

Low 

High 

Low 

Low 

Low 

K 

-0.21** 

-0.1 7** 

High 

Low 

Small 

Low 

Low 

Medium 

Low 

Medium 

Low 

Q 

-0.26** 

-0.13 

Low 

Low Larg 

e 

N/A 

N/A 

N/A N/A 


N/A 

N/A 

N 

-0.27** 

-0.22** 

Low 

Low 

Small 

Low 

High 

Low 

Low 

Medium 

Medium 

H 

-0.30** 

-0.1 5** 

High 

High Large 


Low 

Low 

Low 

Low 

Low 

Low 












Table K.l (continued) 


Note: Columns 1 and 2 report year-two math and reading impacts. The rankings in Columns 3-4 and 6-11 correspond to the percentile 

rank of each CMO relative to the other CMOs in the sampie with data: CMO's in the bo ttom third receive a 'low' ranking, those in the 
middie third receive a 'medium' ranking, and those in the top third receive a 'high' ranking. Column 5 reports the size of the CMO, as 
measured by the total number of schools operated in the fall of 2009. CMOs operating more than 8 schools are classified as "ia rge," 
while those operating 8 or fewer schools are classified as "small." Column 10 shows rankings on Performance Based Compensation, 
and coiumn 1 1 shows rankings on Instructionai Time. 

* Significantiy different from zero at the 0.05 level, two-taile d test 
** Significantly different from zero at the 0.01 level, two-taile d test 



Table K.2. Correlations between Common Instructional Framework Components and Impacts 


Math Reading 


Number of districts where CMO has schools* 

0.01 

0.01 


(0.02) 

(0.01) 

Ratio of central office staff to teachers* 

-0.05 ^ 

-0.04 ** 


(0.02) 

(0.01) 

Ratio of HR and operations staff to teachers* 

-0.24 

-0.1 7 * 


(0.15) 

(0.07) 

Ratio of educational support staff to teachers* 

-0.09 ^ 

-0.07 ** 


(0.05) 

(0.03) 

Ratio of finance-r elated staff to teachers* 

-0.15* 

-0.1 2 ** 


(0.07) 

(0.04) 

Ratio of other central office staff to teachers* 

-2.16 

-1.09 


(3.17) 

(1.74) 

CMO specifies student behavior policies 

-0.06 

-0.05 


(0.10) 

(0.05) 

CMO provides system of assessments 

0.10 

0.05 


(0.08) 

(0.04) 

CMO sets teacher salaries and evaluation 

-0.01 

-0.02 


(0.10) 

(0.05) 

Frequency of CMO visits to school 

0.01 

0.01 


(0.11) 

(0.05) 

CMO provides professional development support 

0.19’^ 

0.12* 


(0.08) 

(0.05) 

CMO provides assistance in areas with weak test scores 

0.12 

0.05 


(0.11) 

(0.05) 

Principal previously worked in CMO 

-0.01 

-0.05 


(0.13) 

(0.06) 

Importance of sample teaching performance for teacher hiring 

-0.00 

0.03 


(0.11) 

(0.05) 

Importance of commitment to school mission for teacher hiring 

0.04 

0.00 


(0.08) 

(0.04) 

Fraction of teachers hired from TFA / Teaching Fellows*'’ 

0.97’'* 

0.28 


(0.27) 

(0.19) 

Teachers can earn tenure 

0.08 

0.04 


(0.09) 

(0.04) 

Principal can get bonus for student achievement results 

-0.06 

0.01 


(0.06) 

(0.04) 

Principal salary 

0.07 

0.07 


(0.08) 

(0.04) 

Teacher looping over grades 

0.04 

0.04 


(0.09) 

(0.04) 

Students instructed in math/ reading with students of similar ability 

-0.01 

-0.02 


(0.11) 

(0.04) 

Importance placed on students exceeding state academic standards 

0.16* 

0.08* 


(0.07) 

(0.03) 
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Table K.2 (continued) 



Math 

Reading 

Enrollment of school 

-0.00 

-0.02 


(0.08) 

(0.04) 

Teacher-student ratio in math / reading classes 

-0.03 

0.04 


(0.13) 

(0.05) 

New students required to attend summer session 

0.14*' 

0.07* 


(0.05) 

(0.03) 

Teachers are covered by collective bargaining agreement 

-0.09* 

-0.01 


(0.05) 

(0.04) 

Frequency of teacher observations by master teacher / teaching coach-' 

0.29'=* 

0.11“ 


(0.06) 

(0.06) 

Frequency of teacher observations by monitor of teacher performance-' 

0.21“ 

0.04 


(0.09) 

(0.07) 

Frequency of teachers receiving feedback in formal evaluation*' 

0.17 

0.03 


(0.14) 

(0.09) 

Frequency of teachers receiving feedback outside of formal evaluation-' 

0.26*’^ 

0.09 


(0.08) 

(0.06) 

Frequency of CMO staff observing classrooms-'' 

0.04 

-0.01 


(0.04) 

(0.02) 

Frequency of CMO staff meeting with teachers one-on-one -■' 

0.00 

0.00 


(0.05) 

(0.03) 

Frequency of CMO staff meeting with principals one- on- one*'' 

0.27*" 

0.15** 


(0.08) 

(0.04) 

Frequency of CMO staff performing school walkthroughs*'' 

0.15“ 

0.07 


(0.08) 

(0.05) 

Frequency of CMO staff analyzing or explaining data in school*'' 

0.15“ 

0.08“ 


(0.08) 

(0.04) 

Source: State, district, and CMO school records. Principal Survey, 

Teacher Survey, and 

CMO Central Office Staff 


Survey. 

“Measure is on a natural scale (not standardized). 

'’Scale is 0-1 . 

'Scale is 1 -5 . 

■“Scale is 0-5. 

“Significantly different from zero at the .1 0 level, two- tailed test. 
^Significantly different from zero at the .05 level, two-tailed test. 
**Significantly different from zero at the .01 level, two- tailed test. 
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Table K3. Correlations between States' Autonomy Afforded to Charters and Impacts 


Math 


Reading 


Low (TX, IN, OH, NV, MD) (reference group) 


Medium (NY, IL, CO, CT, GA, LA, NJ) 



0.08 

0.01 


(0.18) 

(0.09) 

High (DC, FL, AZ, PA, OR, CA, MO) 

-0.01 

0.03 


(0.14) 

(0.08) 

Constant 

0.13 

0.02 


(0.09) 

(0.05) 


Source: State, district, and CMO school records, and data from the National Alliance for Public Charter 

Schools 

''Significantly different from zero at the .1 0 level, two-tailed test. 

*Significantly different from zero at the .05 level, two-tailed test. 

"'"'Significantly different from zero at the .01 level, two-tailed test. 


Table KA. Correlations between Behavior Policy Components and Impacts 


Math Reading 

Consistent school-wide behavioral standards and 


disciplinary policy 

0.15* * ** 

0.06 


(0.08) 

(0.05) 

Zero tolerance policies for potentially dangerous behaviors 

o.zr 

0.10* 


(0.09) 

(0.05) 

Schools have behavior code with student rewards 

0.14'' 

0.05 


(0.07) 

(0.04) 

Schools have behavior code with student sanctions 

0.17'* 

0.07* 


(0.06) 

(0.03) 

Parent or student signs responsibility agreement 

0.12 

0.06 


(0.09) 

(0.05) 


Source: State, district, and CMO school records. Principal Survey. 

^Significantly different from zero at the .10 level, two-taile d test. 

*Significantly different from zero at the .05 level, two-tailed test. 

**Significantly different from zero at the .01 level, two-talled test. 
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Table K.5. Correlations between Teacher Coaching Components and Impacts 




Math 

Reading 

Frequency with which new teachers observed 

by coaches 





0.12* 

0.05 



(0.06) 

(0.03) 

Frequency with which new teachers observed 

by principals / 

0.1 1 

0.03 

administrators 


(0.06) 

(0.04) 

Frequency with which new teach ers receive 

feedback from 

0.10 

0.04 

observers 


(0.08) 

(0.04) 

Frequency with which ne w teachers must 

submit lesson 

0.18 

0.08 

plans for review 


(0.10) 

(0.04) 


Source: State, district, and CMO school records. Principal Survey. 

^Significantly different from zero at the .1 0 level, two-tailed test. 
*Significantly different from zero at the .05 level, two-tailed test. 
**Significantly different from zero at the .01 level, two-tailed test. 
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Table K.6. Correlations between Instructional Coherence, Organizational Health, and Impacts 



Math 

Reading 

Instructional Coherence 

Use of a common instructional framework 

0.13* 

0.04 


(0.07) 

(0.05) 

Cooperative and supportive school environment 

0.08 

0.02 


(0.07) 

(0.05) 

Organizational Health 
High teacher job satisfaction 

0.01 

-0.01 


(0.09) 

(0.06) 

Stable administrative staff 

-0.02 

-0.05 


(0.13) 

(0.06) 

Number of applications for teaching positions 

-0.01 

-0.01 


(0.08) 

(0.04) 

Rate of student turnover 

0.00 

0.01 


(0.10) 

(0.06) 

Minimal legal, financial, and other administrative challenges 

0.03 

-0.00 


(0.14) 

(0.07) 


Source: State, district, and CMO school records. Principal Survey, and Teacher Survey. 


^Significantly different from zero at the .1 0 levei, two-tailed test. 
*Significantly different from zero at the .05 ievel, two-tailed test. 
**Significantly different from zero at the .01 level, two-tailed test. 
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Table K.7. Correlations between Common Instructional Framework Components and Impacts 



Math 

Reading 

Curriculum focuses on helping students meet learning 
standards 

0.31 

0.10 


(0.18) 

(0.12) 

Consistency among same-grade teachers in material taught 

0.57 

0.22 


(0.34) 

(0.20) 

Coordination across grades in material taught 

-0.01 

-0.07 


(0.24) 

(0.14) 

Teachers follow school-wide instructional calendar/ pacing 

0.27 

0.04 

plan 

(0.21) 

(0.15) 

Continuity across school instructional initiatives 

0.26 

0.05 


(0.21) 

(0.13) 

School has well-defined le arning standards for all students 

0.33'' 

0.12 


(0.17) 

(0.11) 

Consistent school-w ide grading and passing standards 

0.31 

0.09 


(0.20) 

(0.14) 

Curriculum has clear path for accelerating under-achieving 

0.36* 

0.10 

students 

(0.19) 

(0.13) 

Teachers have access to student assessment information 

0.30* 

0.10 


(0.11) 

(0.10) 

Teachers frequently modify lesson plans based on student 

0.62** 

0.28** 

test results 

(0.10) 

(0.07) 

School day organized to mini mize events that r educe 

0.33 

0.05 

instruction time 

(0.23) 

(0.15) 


Source: State, district, and CMO school records. Teacher Survey. 


Note: Each measure is on a 1-4 scale. 

^Significantly different from zero at the .1 0 level, two-tailed test. 
*Significantly different from zero at the .05 level, two-tailed test. 
**Significantly different from zero at the .01 level, two-tailed test. 
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Table K.8. Revising Lesson Plans as Mediator of Associations between Primary Hypotheses and 
Impacts 




Math 

Reading 

Teachers frequently modify lesson plans 

based on student 

0.62** 

0.30* 

test results 


(0.16) 

(0.11) 

Comprehensive behavior policy 


0.00 

-0.01 



(0.09) 

(0.06) 

Teachers frequently modify lesson plans 

based on student 

0.52* 

0.20'' 

test results 


(0.15) 

(0.10) 

Emphasize intensive teacher coaching 


0.1 1 

0.09 



(0.11) 

(0.07) 

Teachers frequently modify lesson plans 

based on student 

0.57** 

0.24* 

test results 


(0.11) 

(0.07) 

Instructional hours per year 


0.05 

0.04 



(0.06) 

(0.05) 


Source: State, district, and CMO school records. Principal Survey, and Teacher Survey. 


^Significantly different from zero at the .1 0 level, two-tailed test. 
*Significantly different from zero at the .05 ievel, two-tailed test. 
**Significantly different from zero at the .01 level, two-tailed test. 
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Table K.9. Multivariate Associations between Primary CMO Practices and Impacts 



Math 

Reading 

Comprehensive behavior policy 

0.15** 

0.06* 


(0.05) 

(0.04) 

Emphasize intensive teacher coaching 

0.12 

0.05 


(0.07) 

(0.05) 

Instructional hours per year 

0.04 

0.02 


(0.08) 

(0.06) 

Constant 

-0.00 

-0.01 


(0.06) 

(0.05) 


0.62 

0.38 


Source: State, district, and CMO school records. Principal Survey. 


''Significantly different from zero at the .1 0 level, two-tailed test. 
*Significantly different from zero at the .05 level, two-tailed test. 
**Significantly different from zero at the .01 level, two-tailed test. 
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Table K.10. Interactions between Intensive Teacher Coaching and Other Core Measures 



With Formative 
Assessment 

With Performance- based 
Compensation 


Math 

Reading 

Math 

Reading 

Intensive teacher coaching 

0.15 

0.06 

0.17 

0.07 


(0.10) 

(0.06) 

(0.11) 

(0.05) 

Emphasize formative assessments 

0.06 

0.04 




(0.11) 

(0.07) 



Intensive teacher coaching * 

-0.04 

-0.05 



Emphasize intensive teacher coaching 

(0.16) 

(0.08) 



Emphasize performance- based compensation 



-0.04 

0.01 




(0.11) 

(0.06) 

Intensive Coaching * 



0.06 

0.01 

Emphasize performance- based compensation 



(0.10) 

(0.05) 


Source: State, district, and CMO school records. Principal Survey. 


^Significantly different from zero at the .1 0 level, two-tailed test. 
•'Significantly different from zero at the .05 level, two-tailed test. 
**Significantly different from zero at the .01 level, two-tailed test. 
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